Links:
ASPSeek a full-featured medium-to-large scale SQL-based Internet search engine. It consists of indexing robot, search daemon and search frontend (CGI program) Da'ath (commercial) Da'ath is an organized, easy to use, in-depth search tool using MySQL full-text searching. This allows searching for error messages, select words, phrases, or null searches, to produce a relevant search result from searching all fields of importance. DataparkSearch a full-featured web search engine released under the GNU General Public License. DataparkSearch consists of two parts. The first part is indexing mechanism (indexer). Indexer walks over html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search using data collected by indexer ddc-concordance ddc-concordance is a search engine developed specially to meet the needs of linguistic researchers. DejaSearch a frontend to Deja.com, the leading Usenet archive and search engine Douglas Thrift's Search Engine an indexing search engine for use on small websites such as personal or small business sites. It is designed to be very similar to Google for end users and its output is customizable. For indexing, it supports both the Robots Exclusion Protocol and the Robots META Tag DreamCloak allows you to create unlimited cloaked entry pages for each actual page across unlimited sites DuckDuckGo DuckDuckGo is a search engine focused on relevant results and respecting user privacy. It is a mash-up of several other sites like Wikipedia, About, Bing, and Yahoo. Estraier a full-text search system for personal use. Full-text search means functions to search lots of documents for some documents including specified words filofant filofant is an indexing server for e-mails, attachments and other documents stored on various locations in your company. The indexed documents are accessible by a customizable web frontend like an internet search engine. FM SiteSearch Pro (commercial) FM SiteSearch Pro adds a search capability to a web site. It comes with a relevance engine, control panel, large web site support, mysql support (optional), search/keyword statistics, advanced searches, specialized searches, fully customizable and many more. focuseek searchbox can spider sites and power the search function of a web site or portal, or it can index information from any source and enable search in your business processes FtpLocate a fast FTP search engine written with Perl gonzui gonzui is a source code search engine for accelerating open source software development. In the open source software development, programmers frequently refer to source codes written by others. Our goal is to help programmers develop programs effectively by creating a source code search engine that covers vast quantities of open source codes available on the Internet. Goose Search allows you to search Google's index of the Internet from the command line. You run Goose, giving it your list of search terms, and it presents a list of search results using an easy to navigate Curses display in your terminal. You can then select a search result to open in your web browser Harvest a full featured web based search system for any kind of documents HarvestMan a web-crawler written in the python programming language. HarvestMan is a full-featured, multithreaded web-crawler written in python. HarvestMan supports as much as 40 customization options as of the current stable version Heritrix the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project htdoogle a Web interface for the HTDIG search engine. It is fast and intuitive HtSearch a PHP interface to htsearch, a frontend to ht://Dig. Hyper Estraier Hyper Estraier is a full-text search system. It can be used as a Web search engine, mailbox searching, etc. It features high performance searching, high scalability of target documents, a perfect recall ratio by the N-gram method, phrase searching, attribute searching, and similarity searching. Multilingualism is supported with Unicode. imgSeekWeb imgSeekWeb is based on the imgSeek project. The final goal is a distributed server side content-based image search engine. IntelliSearch Intellisearch is a concept thought up to combine Dasher, an on screen predictive keyboard, with Yahoo!'s predictive search functionality, in an accessible manner. isbnsearch a distributed search portal of common sources of ISBN numbers, with permanent caching of results. To provide a open-source free interface for ISBN retrieval using HTML, SQL or XML to be independent of any toolkits or software KeywordXS KeywordXS is a small tool to get keywords for specified search terms. It will simply query a given search term and display the associated results. It uses the Internet so that all the keywords are top actual. locust locust is a full featured Internet search engine specifically designed for knowledge area or corporate search. It can index 2.5 million documents per 24 hours on a single Dell server. It consists of clean C++/STL code written from scratch. LuMriX a search engine that exploits XML and XML Topic Maps. In contrast to other retrieval methods, it does not relate single items to resources, but combines given items into meaningful associations (concepts), which are in turn linked to resources mguesser a standalong part of libudmsearch (a core of mnogo search engine http://mnogosearch.org) which allows to guess text's charset and language mnoGoSearch extension for PHP nmGoSearch extension for PHP is a complete PHP binding for the mnoGoSearch API. mnoGoSearch-php a full-featured web search engine software for intranet and internet servers Montezuma Montezuma is a full-text indexing/search engine library written entirely in Common Lisp. Montezuma is a Common Lisp port of Ferret. Ferret is a Ruby port of Lucene. mygosuMenu a simple, lightweight, fast, free, search engine friendly DHTML menu, compatible with most browsers Namazu a full-text search system intended for easy use. Not only it works as a CGI program for a small or medium scale Web search engine, but also works as a personal use search system for your pile of emails NVBase NVBase is an information retrieval system that makes any data within an enterprise available. Any source of information including emails, RDBMS, file systems, and Web pages can be indexed and searched. Pagecast a program that makes it easy to send lists of URLs to popular internet search engine services Perlfect Search a sophisticated, powerful, versatile, customizable and effective site indexing/searching suite available under an open source license (GPL). It comes as a pair of disctinct scripts. The indexer, that automatically scans and indexes a web site, and the search engine, a cgi script that serves search queries for keywords over the index, and displays results pages in html, in a standard format including title, description and relevance ranking for each matching document Personal Search Engine a tool that allows a webmaster or developer relatively easy add a local search engine to an existing web site PhpDig PhpDig is a web spider and search engine written in PHP, using a MySQL database and flat file support. PhpDig builds a glossary with words found in indexed pages. On a search query, it displays a result page containing the search keys, ranked by occurrence. phpLinks an open source project written in PHP for use with MySQL, allowing one to run an extremely efficient Link Farm with full search capabilities. A "simulated" search engine in many ways phpSERA a PHP/MySQL-based tool for Search Engine Ranking Analysis (SERA). The rankings are based on parsing output of search engines, using simple regular expressions POPsearch personal search engine that is designed to help you easily organize and find information on your computer. POPsearch lets you index your entire collection of email messages and files. This collection can then be searched from any web browser pro-search pro-search is a crawler for FTP servers, SMB shares, HTTP servers, and DC++ networks. It has a powerful, Web-based search and navigation interface. pseudo-cron allows users to use cron jobs on a Website without shell access. Whenever any user requests a page which uses pseudo-cron, it checks if any cron jobs should have been run since the previous request PyLucene a GCJ-compiled version of Java Lucene integrated with Python via SWIG. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene Pyndex a simple full text indexer written in Python. It uses Metakit as its storage manager, so you need to have Metakit installed Quick Submit Quick Submit is an automatic search-engine URL submitter. It is a perl CGI which allows you to submit your website to search engines in a matter of minutes. Satellite2 a website indexing/search fascillity, written in Perl with (planned) : support for a large range of file formats (txt, html, doc, xsl, pdf, chm), Unlimited number of different indexes possible with one Satellite installation, Unlimited number of resulpage templates possible, and a web based administrator which allows easy adding, modifying and deleting of indexes SBC Links a very simple, light and compact PHP/MySQL Link/Download Indexing System that is very configurable and very easily adaptable to any Web site Sherlock Holmes a universal search engine a system for gathering and indexing of textual data (text files, web pages, ...), both locally and over the network Simple Python Distributed Indexing SPyDI Is a powerful engine to create distributed full text indexing systems and distributed search engines. It supports harvesting, crawling (pull mehtods), and push methods (via a Web interface or SPyRO Web services). It supports boolean and vector Information retrieval models. It has few dependencies, and comes with its own HTTP server and HTML embedded pages language (called pyew and wey pages), and session manager. It can use the SMTP of the Python library. It supports replacing the default modules with some better modules (Apache, exim, etc).
Next 50