Google Makes Dataset Discovery Easier
Written by Kay Ewbank   
Friday, 14 September 2018

Google has launched a customized search aimed at 'scientists, data journalists, and data geeks' who need to find datasets no matter where they're hosted.

googledataset search

The aim of the search is to let people find the data they need from the many data repositories on the web. The tool works in a similar way to Google Scholar, which can be used to search academic papers for data.

Dataset Search in part relies on the creators or providers of the dataset making metadata available for the search, such as who created the dataset, when it was published, a citation describing the dataset, summary keywords, and spatial coverage. These metatags are indexed by Dataset Search and combined with input from Google’s Knowledge Graph, which is what shows as an infobox next to search results to make the results more useful. Google collects and links this information, analyzes where different versions of the same dataset might be, and finds publications that may be describing or discussing the dataset.

The current version of Google Dataset Search has references to most datasets in environmental and social sciences, as well as data from other disciplines including government data and data provided by news organizations.

The developers say that as more data repositories use the schema.org standard to describe their datasets, the variety and coverage of datasets that users will find in Dataset Search will continue to grow. As Google acknowledges, the success of DataSet Search will depend on organizations choosing to add the metadata tags to their material to make it accessible to the indexing process, but given the power of Google, it's unlikely that any organization making data available on the web will ignore this requirement.

Dataset Search works in multiple languages, and support for additional languages is 'coming soon'.

google

More Information

Dataset Search

Related Articles

Google Uses Search Data To Predict Box Office Hits

The Allen Institute's Semantic Scholar 

RankBrain - AI Comes To Google Search

Allen Institute Asks "Can You Make An AI Smarter Than An 8th Grader"

Fuzzy Logic And Uncertainty In AI

Find Prior Art Added to Google Patent Search 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


ZLUDA Ports CUDA Applications To AMD GPUs
18/04/2024

ZLUDA is a translation layer that lets you run unmodified CUDA applications with near-native performance on AMD GPUs. But it is walking a fine line with regards to legality.



Pure Virtual C++ 2024 Sessions Announced
19/04/2024

Microsoft has announced the sessions for Pure Virtual C++ 2024, which is taking place on April 30th 15:00 UTC. People who sign up will get access to five sessions happening on the day, alongside a ran [ ... ]


More News

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info