The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine, Lemur Toolbar, and ClueWeb09 dataset. Our software and datasets are used widely in scientific and research applications, as well as in some commercial applications.
The Lemur Project's software development philosophy emphasizes state-of-the-art accuracy, flexibility, and efficiency. For example, the Indri search engine provides accurate search for large text collections 'out of the box', and data is stored in an accessible manner to support development of new retrieval strategies. Software from the Lemur Project is distributed under open-source licenses that provide flexibility to scientists and software developers.
L. Teyou, C. Demir, and A. Ngomo. Proceedings of the 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, Santiago de Compostela, Spain, ECAI24, (October 2024)
L. Teyou, C. Demir, and A. Ngomo. Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, Boise, Idaho, USA, CIKM24, (October 2024)
N. Kouagou, S. Heindorf, C. Demir, and A. Ngonga Ngomo. Proceedings of the Thirty-Third International Joint Conference on
Artificial Intelligence, IJCAI-24, International Joint Conferences on Artificial Intelligence Organization, (2024)Main Track.