Skip to Main Content

Search for Text Data Sets: Newspapers & Media

Accessing Newspaper & Periodicals (Magazine) Datasets

We work on a case-by-case basis to purchase datasets for researchers, with higher priority going to data with permanent rights and multi-user licenses. Please allow for time to arrange access. Request a data purchase from the databases below.

ProQuest Databases

Full-text Data Mining

For ProQuest databases, archival rights and hard drives must be purchased from ProQuest.

Currently Available Drives

The library provides access to the hard drives by appointment. Contact your liaison or the Scholarship and Research Initiative to arrange access.

Citation & Abstract Mining

ProQuest My Research account

Using the Results Export tool, you can export up to 10,000 results two times a day. Results include citation information, as well as the abstract. You will need to make a ProQuest My Research Account.

  • Note: Anything beyond this is considered harvesting or programmatic downloading, which is not allowed under our license. Please contact us with requests for full-text downloading.

Gale Databases

Gale provides newspaper and magazine archives for text mining. For Gale databases, archival rights and hard drives must be purchased from Gale.

Readex

Readex provides in database access to the text analysis tool Voyant in the below  databases. Look for the Text Explorer link, near the Advanced Search option in the database.

Accessible Archives

Accessible Archives provides access to text data from the African American Newspaper Collections in XML files. 

Newsbank

Newsbank provides access to text data from their collections (for a fee), based on scope.

Newspapers from Digital Libraries

Digital libraries that offer access to digitized versions of newspapers through APIs.

Newspapers Direct from Publisher