Internet ArchiveDigitized print resources and born-digital content, with a special focus on web pages and digitized books. Use wget to download files from their site in bulk. Data delivered in XML or JSON format. No download limit, but they recommend downloading only 10,000 items per query to prevent errors.