计算之智与哲学之慧分享 http://blog.sciencenet.cn/u/huangfuqiang

博文

信息检索(IR)领域的杰出干将Doug Cutting

已有 4534 次阅读 2011-4-29 21:30 |个人分类:信息检索与搜索引擎|系统分类:海外观察| Lucene, Doug

Lucene大名鼎鼎,以下文本中有的链接打不开,可以自行检索。



Doug Cutting has been working in the field of information retrieval for over fifteen years.

Beginning in 1988, he spent five years at Xerox's Palo Alto Research Center (PARC) developing novel approaches to information access. These included a high-performance retrieval engine, several innovative search paradigms, advanced linguistic analysis methods, and high-quality text summarization algorithms. This work resulted in seven publications and six issued patents. Some of these technologies are now marketed by Inxight.

In 1993 he moved to Apple's Advanced Technology Group (ATG). There he developed a state-of-the-art retrieval engine code-named V-Twin. This engine was to be a part of the Copland operating system, automatically indexing the content of all files as they are created so that the the entire file system could be efficiently searched at any time. Copland was cancelled, but V-Twin has been used in several other Apple products.

In April of 1996, Doug left Apple and joined Excite where he took over development of the core search technology. This included growing Excite's web index from two million to fifty million pages; substantially optimizing Excite's search performance; adding phrase-searching capabilities; and creating a thesaurus-like feature which suggests related terms to add to queries.

In the fall of 1997 he reduced his commitment at Excite to part-time so that he could write Lucene, an efficient, full-featured text search engine written in Java. In early 1998 he returned to Excite full-time for two more years. Lucene sat on the shelf for much of that time, and was made open-source in the spring of 2000.

Doug now works as chief architect and president of Nutch, a nascent effort to implement an open-source web search engine, which aims to provide a transparent alternative to commercial web search engines. The specific purposes for which this corporation is organized are scientific and educational in nature: namely, to promote public access to search technology without commercial bias by:
* Providing free high-quality search software and its source code to the public; and
* Facilitating ongoing research and development of search technology in a public forum.

Doug also serves on Nutch's board of directors, together with Mitch Kapor, Tim O'Reilly, Peter Savich ( Overture Research), Raymie Stata (UCSC), and Graham Spencer ( Digital Consumer).
信息来源:http://www.wizards-of-os.org/archiv/sprecher/a_c/doug_cutting.html


https://blog.sciencenet.cn/blog-89075-438781.html

上一篇:"繁星(Stars)"蠕虫病毒撒向伊朗
下一篇:软件基础研究与基础软件110504
收藏 IP: 123.235.155.*| 热度|

1 许培扬

该博文允许注册用户评论 请点击登录 评论 (2 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-11-22 17:46

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部