OmegaSoft Homepage  |  Products & Services  |  About OmegaSoft


 

OmegaSoft PathSearch Technology Project

The OmegaSoft PathSearch Technology Project is an on going research group looking into web data mining and information retrieval.  Current focuses include hypertext searching and web citation, this has lead to the development of the PathSearch search engine.

News & Press

All press articles, news and blog entries about the PathSearch search engine can be found here.

History

The OmegaSoft PathSearch Technology Project was setup in July 2004, following the development OmegaSoft's web search engine, PathSearch.  The group have continued to maintain, refine and develop innovating search technologies.

  • June 2004, 'Leon' search engine developed.  This crude search tool allowed for minimal crawling and indexing of the web and simple (single word) search queries.
     
  • July 2004, OmegaSoft PathSearch (codenamed Milestone One) developed and placed into beta testing.  This engine had an increased and more robust database, search crawlers and could support multi-word queries.  It contained a database of approximately 5,000 searchable websites.
     
  • February 2006, OmegaSoft PathSearch M2 (codenamed Milestone Two) launched.  This engine was a complete rewrite, using relational databases and a bank of robust crawlers.  Search times were improved and the index increased up to 36,000 searchable websites and 'blogs'.
     
  • Currently developing the M3 engine (codenamed Milestone Three).  Specific details have yet to be released

PathSearch Tuning Team

In addition to the main research and development team, there is also a tuning team which concentrate on maintaining and fine-tuning the search engine post launch.

Work includes database optimisation to speed up search times, crawler development, user support and interim development on improved coding / algorithms.

Third Party Research

Third party (private or academic) research groups can also access PathSearch's complex data structures to conduct research on the data held within.  Common research topics include;

  • User search trends
  • Web citation trends
  • HTML analysis
  • Physical web structure

Access to the data is made available from a secure online 'Spacelab' portal, which allows researchers to run queries on the database and display trends.

If you would like access to this large repository of raw data, please send all enquiries to pathsearch@omegasoft.co.uk.  We will ask for some background information, such as, what will you be using this data for?  Commercial interest is strictly prohibited.  Personal data relating to users is also restricted.  Personal data is anonymised before use, protecting individual identities.

Links

Contact

Please email pathsearch@omegasoft.co.uk with comments and suggestions



© Copyright 2008  |  Terms of Use  |  OmegaSoft Homepage  |  Feedback