YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework.  YouSeer utilizes Heritrix as a crawler and Solr as an indexing system. The framework provides software to ingest the documents harvested by Heritrix into Solr. The ingesting software is very flexible and allows for user-specific data extraction implementations.  Further, YouSeer provides a simple interface to query the index and another interface to retrieve cached versions of the documents.

 

What is YouSeer?

YouSeer is available for download on SourceForge.

Download

This tutorial will guide you through your first job on YouSeer.

Getting Started

The documentation is available here.

Documentation