Home
Welcome to the Infinite Monkeywrench (IMW) wiki!
The Infinite Monkeywrench is a collection of tools to download, clean, process, and package datasets from a variety of sources (HTML, RSS, XML, CSV, &c) into a variety of formats (XML, CSV, Excel, JSON, SQL, YAML, &c). Interacting with IMW is as simple as creating a YAML file which describes the workflow involved in processing the data and feeding it to the imw command line program.
IMW can be used by individuals, research groups, businesses, or institutions for organizing and aggregating the data they rely upon from external and internal sources into a common conceptual and technological space. It takes all the drudgery out of data processing (ETL) and lets you use your data!
IMW is also the backend which powers Infochimps , a data aggregation website which aims to create the world’s best repository for raw data. The better IMW gets, the easier it will be to help Infochimps evolve into what it should be.
