Can't find a software application?
Submit it to OStatic
Click a filter below to apply it to results
The TEI is an international and interdisciplinary standard used by libraries, museums, publishers, and academics to represent all kinds of literary an...
mnoGoSearch (formerly known as UdmSearch) is a full-featured Web search engine that you can use to build search engines over HTTP, HTTPS, FTP, and NTT...
The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, text), and ...
Greenstone is a complete digital library creation, management, and distribution package created and distributed by the New Zealand Digital Library Pro...
Crawler and content extractor for building a full text index of a website's contents. Uses Ferret for indexing.
Rcrawl is a web crawler written in ruby.
Ruby/ZOOM provides a Ruby binding to the Z39.50 Object-Orientation Model (ZOOM), an abstract object-oriented programming interface to a subset of the ...
PHPX is a Web portal system, blog, Content Management System (CMS), forum, and more. It is designed to allow everyone to be able to have feature-rich,...
Sesame is a Java framework for storing, querying and inferencing for RDF. It can be deployed as a web server or used as a Java library. Features inclu...
Update: uformatparser is deprecated, please use scrAPI instead. It does microformats, and much more. See: http://rubyforge.org/projects/scrapi/