Giant Index Comprising Millions of Research Papers Released Online for Free

The General Index, created by American archivist Carl Malamud, was released earlier this month.

Giant Index Comprising Millions of Research Papers Released Online for Free

Photo Credit: Unsplash/ Patrick Tomasso

The new online database helps index millions of research papers for easy browsing

Highlights
  • The General Index has been developed by archivist Carl Malamud
  • The index comprises billions of research papers
  • The General Index is free for all

With so much research getting published every day all over the world, a super-smart search engine has become essential to help parse through seemingly endless scores of academic papers. Faced with the challenge, a technologist has found a way to unlock the world's research papers for easier computerised analysis. He has released an index of some 107.2 million journal articles online, including many paywalled research papers, totaling 38TB of data in its uncompressed form.

The General Index, created by American archivist Carl Malamud, was released on October 7 and is free to use. The index holds over 355 billion sentence fragments and words listed next to articles in which they appear. “It is an effort to help scientists use software to glean insights from published work even if they have no legal access to the underlying papers,” Malamud told Nature journal.

The primary objective of this index is to help with text mining, a process of using computers to quickly scan millions of data points to find references to something specific. Humans can't possibly read data from millions of journal articles, but a computer programme connected to the General Index can.

A set of researchers, who have had early access to the index, termed it as a big development. Gitanjali Yadav, a computational biologist at the University of Cambridge, UK, who studies volatile organic compounds emitted by plants, said this index will help researchers in accessing many research papers that already existed but were previously lost somewhere. Researchers were earlier restricted to mining only open-access papers or those that they had subscribed to. But this index will be of great help to them.

Malamud said his index contains only snippets up to five words long, so releasing it does not breach publishers' copyright restrictions.


What's most interesting about Apple's new MacBook Pros, M1 Pro and M1 Max silicon, AirPods (3rd Generation), and Apple Music Voice plan? We discuss this on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Comments

For the latest tech news and reviews, follow Gadgets 360 on Twitter, Facebook, and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel.

Further reading: General Index, Research Papers
Google Pixel 6 Pro vs iPhone 13 Pro Max Speed Test Video Shows a Very Close Call
Share on Facebook Tweet Snapchat Share Reddit Comment
 
 

Advertisement

Advertisement

Advertisement

© Copyright Red Pixels Ventures Limited 2021. All rights reserved.
Listen to the latest songs, only on JioSaavn.com