DocFetcher
DocFetcher is a free and open source desktop search application. It runs on Windows, Mac OS X and Linux and is written in Java. The application has a graphical user interface, which is written using the Standard Widget Toolkits.
The program is an indexing search tool, meaning it has a local database of file content that it checks, rather than looking over all files on your machine. This means the program must always be running to monitor changes, but search results are instant. Search tools are based on Apache Lucene software, a widely used, open source search engine.
Features
- Unicode support
- Full text search for all major document file formats, including:
- *Office files
- *EPUB, PDF
- *RTF, SVG and any other plain text files
- *Audio metadata
- *Picture metadata
- *Archive formats. Also supports nested archive files
- *HTML with pair detection. Which means that DocFetcher detects when an HTML file and a folder containing the resource files of the page belong together.
- Possibility to automatically detect file changes and update the index accordingly
- Exclusion of files from indexing based on regular expressions
- A query language supporting boolean operators, wildcards, phrase search, fuzzy search and proximity search
- World languages: translations in Chinese, Italian, Ukrainian. Partly translated to French, Japanese, Spanish, and German.