The site’s users can also set certain answers to “private,” which makes the responses inaccessible to others.
In this case, the researchers scraped and presented the data accessible through Google and Q&A responses from individual profiles.
Some questioned whether such data harvesting, bundling, and broadcasting is justifiable for academic research and whether it crosses ethical and legal lines.
Although the researchers did not release the real names and pictures of the Ok Cupid users, critics noted that their identities could easily be uncovered from the details provided—such as from the usernames.
) He also argued that retaining the information in the dataset would allow certain missing details—like height, profile text, or photos—to be added later.
The data, collected from November 2014 to March 2015, is indeed public—sort of.
“If the journal does not take the paper, we will probably publish it elsewhere,” he said.
Ok Cupid, owned by Inter Activ Corp’s (iac) Match Group (mtch), released a statement that complained about the published data.
He added that it would be easy to identify more than 10,000 of the people in the data dump and link them to their sexual inclinations.Our data extraction software can automatically walk through whole web sites and collect complete content structures such as product catalogs or search results.Download Visual Web Ripper trial Install our web scraper software now and start extracting data from the web today.With your free project and the help of our dedicated support staff, you will be able to start creating your own projects in no time.Content Grabber is tailored to corporations who need improved performance and reliability along with management facilities for easily monitoring multiple web scraping agents.