参加ご希望の方は、SIGMODホームページにて
( http://www.sigmodj.is.uec.ac.jp/)
会員登録の後(会費無料、すでに登録されている方は結構です)、
sigmodj_lecture@tkl.iis.u-tokyo.ac.jpに
申込書をお送り下さい。
喜連川(東大、SIGMOD-J Chair)
連絡先 ACM SIGMOD日本支部
sigmodj_lecture@tkl.iis.u-tokyo.ac.jp
http://www.sigmodj.is.uec.ac.jp/
8<------------------------------------
To: sigmodj_lecture@tkl.iis.u-tokyo.ac.jp
ACM SIGMODJ 11月26日講演会 参加申し込み用紙
お名前:
ご所属:
e-mail:
8<------------------------------------
☆講演会 ====================================================
タイトル:Clustering Web Documents
講演者: Prof. Mukesh Mohania (IBM India)
日時: 11月26日 午後5時半 ー 6時半
場所 東京大学生産技術研究所(駒場キャンパス㈼
E棟/西側/5階会議室A,B (Ew-501,502)
TEL 03(5452)6254/6256
http://www.iis.u-tokyo.ac.jp/map/index.html
小田急線 東北沢(一番近いです)から7分
小田急線 代々木上原から12分
井の頭線 東大駒場前から10分
東大駒場駅からいらっしゃる方は東門からお入り下さい。
参加費用 無料
----------------------------------------------------------
Title: Clustering Web Documents
Speaker: Prof. Mukesh Mohania (IBM India)
Abstract:
Users are increasingly relying on search engines to obtain
useful information from the web. It is becoming more and more
difficult for users to find relevant information as a large number of
documents are returned as a result of a search. Hence, in order to
make the search, it is necessary to categorize documents into sets
(i.e. clusters) based on some subject or similarity. A way to cluster
documents based on relative similarity between them will be explored
in this talk. The documents are scanned and important keywords or
document representatives are obtained from each document. Weights are
assigned to these keywords based on their location in the document,
frequency and various other factors. We will then discuss the
Row-Column Iterative Algorithm that is applied on the set of N
documents to form clusters based on relative similarity of
documents. We will also discuss some on-going research projects.