The presentation provided an overview of the HathiTrust Research Center (HTRC) and its services. HTRC provides access to over 13 million digitized book volumes and facilitates text mining and analysis through its extracted features dataset, data capsule, and other tools. It discussed challenges of text mining copyrighted works and demonstrated use cases using distant reading techniques. HTRC also works on outreach, education, and developing new interfaces and tools to enable scholarly research using its collections and infrastructure.
Related topics: