Published June 22, 2019 | Version v1
Other Open

Collection 5: U.S. Top Newspapers, 1977-2018 (articles mentioning "humanities" or "liberal arts")

Description

A collection of word-frequency and other data representing 30,323 unique articles mentioning "humanities" or "liberal arts" (no duplicate or close-variant documents) published from 1977 to 2018 in the 15 top-circulation U.S. news sources and their associated blogs. The word "humanities" occurs 39,890 times in 28,398 documents in the collection, while the phrase "liberal arts" occurs 2,888 times in 2,380 documents. WE1S and other researchers use this data to look for broad patterns and to help guide closer study.

The sources (ordered by number of articles in descending order) are: New York Times, Washington Post, Los Angeles Times, Chicago Tribune, News Day, Boston Globe, Dallas Morning News, Star Tribune Minneapolis, Houston Chronicle, Daily News, Seattle Times, Denver Post, USA Today, Tampa Bay Times, New York Post. (Full sources and counts are available as a csv file in the Collection Dataset.)

Collection Metadata

  • Created by: Lindsay Thomas
  • Created on: June 22th 2019, 12:00:00 am
  • WE1S Collection Registry ID: 20190622_2208_us-humanities-libarts-top-newspapers
  • Data sources: LexisNexis (via LN Web Services Kit), ProQuest, and direct scraping from the Web.

Suggested Citation for Collection

WhatEvery1Says (WE1S) Project. (2019, June 06). Collection 5: U.S. Top Newspapers, 1977-2018 (articles mentioning "humanities" or "liberal arts"). doi: http://10.5281/zenodo.4914736.

Notes

WE1S makes available only "non-consumptive use" word frequency, topic model, and other datasets along with their visualizations. Datasets cannot be used to access, read, or reconstruct the original texts.

(See WE1S Research Materials Overview for the relation between the project's "datasets" and "collections.")

Files

Files (275.8 MB)

Name Size Download all
md5:208e18cb6484503c39a6c55e0a693cbe
275.8 MB Download