Can't find a software application?
Submit it to OStatic
Click a filter below to apply it to results
Crawler and content extractor for building a full text index of a website's contents. Uses Ferret for indexing.
Rcrawl is a web crawler written in ruby.
Ruby/ZOOM provides a Ruby binding to the Z39.50 Object-Orientation Model (ZOOM), an abstract object-oriented programming interface to a subset of the ...
PHPX is a Web portal system, blog, Content Management System (CMS), forum, and more. It is designed to allow everyone to be able to have feature-rich,...
Sesame is a Java framework for storing, querying and inferencing for RDF. It can be deployed as a web server or used as a Java library. Features inclu...
Update: uformatparser is deprecated, please use scrAPI instead. It does microformats, and much more. See: http://rubyforge.org/projects/scrapi/
scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from H...
ViewVC (formerly ViewCVS) is an open source tool for viewing the contents of CVS and SVN repositories using a web browser. It allows you to look at sp...
DCP-Portal is a content management system with advanced features like Web-based update, link, file, member management, poll, calendar, etc. Its main f...
libextractor is a library used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extract...