Mining ideas from Google Book Search
via read20-l
Bill Schilit's Forum talk on the work at Google on using quotations to establish hyperlinks between scanned books, and going beyond that to "idea mining", is now available.
Abstract: http://www.parc.xerox.com/cms/
get_article.php?id=772 WMV: mms://216.93.180.194/parc_
forum/v1232.wmv
the abstract
Scanning books, magazines, and newspapers is widespread because people believe a great deal of the world's information still resides off-line. In general, after works are scanned they are indexed for search and processed to add links. In this talk I will describe a new approach to automatically add links by mining repeated passages. This technique connects elements that are semantically rich, so strong relations are made. Moreover, link targets point within rather than to the entire work, facilitating navigation. Our system has been run on a digital library of over 1 million books (Google Book Search), has been used by thousands of people, and has generated the world's largest collection of quotations. I will also present a follow-on project based on the theory that authors copy passages from book to book because these quotations capture an idea particularly well: Jefferson on liberty; Stanton on women’s rights; and Gibson on cyberpunk. These projects suggest that mining quotations for links and ideas are an important mechanism for understanding the knowledge contained in books.
the paper citation
Generating Links by Mining Quotations.
Okan Kolak and Bill N. Schilit.
To appear in Proceedings of the nineteenth ACM conference on Hypertext
and Hypermedia (Pittsburgh, Pennsylvania, United States, June 19-21,
2008). HYPERTEXT '08. ACM, New York, NY.
PDF
(thanks to Bill Janssen for the announcement text, which I copied exactly because it captured an idea particularly well)
idea mining sounds brilliant!
after all - effective and meaningful communication reflects symmetrical thoughts.
very interesting how authors copy meaningful passages.
good artists copy - great artists steal. :P
effective communication is speaking as simple as possible - but not any simpler.
the essence of language is understanding.
understanding brings love/wisdom.
misunderstanding brings hate/ignorance.
love brings happiness.
cherish curiosity.
seek wisdom and you will be happy.
at the root of every evil is ignorance.
:)
keep up the amazing work!
Posted by: leaves | 01 November 2008 at 07:43 PM