[CivicAccess-discuss] Mixing Up Great Data Without the Hassle of Scraping | YDN Blog - Yahoo!

Wed May 8 03:27:53 AEST 2013

Interesting, though it just moves the issue. It's not a solution, it just creates a different problem. :) But still… interesting.

On Tue, 07 May 2013 17:24:37 GMT
In Mixing Up Great Data Without the Hassle of Scraping | YDN Blog - Yahoo!
At http://developer.yahoo.com/blogs/ydn/mixing-great-data-without-hassle-scraping-065032245.html

> Import.io solves the problem or obtaining data 
> from multiple sources on the web by providing APIs 
> which query and extract data from websites into 
> consistent JSON data formats.
> 
> The import.io extractor tool allows developers to 
> train our platform to recognise semi-structured 
> information in web pages so that it can provide 
> information extraction from web pages on demand 
> over REST APIs. The information is extracted from 
> the pages and returned in a structured JSON 
> document which adheres to the schema that you 
> define when you create the extractor.

-- 
Karl Dubost
http://www.la-grange.net/karl/