Google Dataset Search Engine
Introduction
Gathering and organizing as much information as possible, and easing the access for everybody around the world, has always been Google’s goal[1].
This goal’s first target was the commercial web, yet, the company has recently changed its usual target, and has shifted a bit towards the scientific community[1].
To better serve scientists and researchers, on September 5th, 2018, Google has launched a service called Dataset Search, which is a search engine just for finding datasets on the internet[2][1]. This search engine will act as an additional search tool for scientific communities, along with the company’s Google Scholar, which is the company’s most popular search engine for academic studies and reports[2].
Google Dataset Search Engine
Researchers have usually relied on sources like: the World Bank, NASA, ProPublica, IEA, IRENA or other search engines, but this new tool should make their work much easier, as it is dedicated to help journalists and researchers to easily reach the available public data[3].
Google Dataset Search will allow its users to search through enormous amount of datasets across tens of thousands of repositories on the web, whether they exist on a governmental database, public sources, publisher’s site, a digital library or an author’s personal web page[2][3]. It applies the usual Google’s approach and design for search, but to datasets published across the web[3]. For an example: the searcher can use a typical research words “energy access” to look for energy access trends in general, or “energy access:IRENA” for retrieving results about energy access from the International Renewable Energy Agency in particular, as the engine supports shortcuts that works on Google regular search tool[3].
The initial Dataset Search covers: environmental & social sciences, governmental data and datasets from news organizations, and is expected to expand to include more sources shortly[2][1].
The tool already supports multiple languages, and will be able to support more languages soon, and it is also expected to be used by scientists, journalists, data geeks, and basically anyone who is using data behind a specific topic[3][4].
Commentary
Here are couple of comments from people either working on the project or working with it:
- “The aim is to unify the tens of thousands of different repositories for datasets online. We want to make the data discoverable, but keep it where it is.” Natasha Noy, a research scientist at Google AU who helped create Dataset Search[2].
- “This type of search has long been the dream for many researchers in the open data and science communities.” Ed Kearns, chief data office at NOAA[2].
- ↑ 1.0 1.1 1.2 1.3 Vincent, J. (2018). Google Launches New Search Engine to Help Scientists Find the Datasets They Need: Dataset Search Could Be a Scientist’s Best Friend. Retrieved From: https://www.theverge.com/2018/9/5/17822562/google-dataset-search-service-scholar-scientific-journal-open-data-access
- ↑ 2.0 2.1 2.2 2.3 2.4 2.5 Lahoti, S. (2018). Google Launches a Dataset Search Engine for Finding Datasets on the Internet. Retrieved From: https://hub.packtpub.com/google-launches-a-dataset-search-engine-for-finding-datasets/
- ↑ 3.0 3.1 3.2 3.3 3.4 Mehta, I. (2018). Google’s New Search Engine Reveals Public Datasets for Research and Journalism Projects. Retrieved From: https://thenextweb.com/google/2018/09/06/google-launches-dataset-search/
- ↑ Schwarz, B. (2018). Google Dataset Search: A New Search Service to Find Data from Sciences, Government, Some News Organizations – Using Dataset Schema, You Can Mark Up Your Data to get it Included in the New Google Dataset Search Engine. Retrieved From: https://searchengineland.com/google-dataset-search-a-new-search-service-to-find-data-from-sciences-government-some-news-organizations-304968