OpenAlex offers multiple ways to access data beyond the REST API. Choose the approach that best fits your use case.Documentation Index
Fetch the complete documentation index at: https://developers.openalex.org/llms.txt
Use this file to discover all available pages before exploring further.
When to use the API vs. downloads
Use the REST API
- Quick lookups and searches
- Building applications
- Real-time data needs
- Most use cases
Use data downloads
- Large-scale analysis
- Machine learning training
- Building local search indexes
- Offline access requirements
Download options
OpenAlex Snapshot
The complete OpenAlex database as gzip-compressed JSON Lines files. The free public snapshot is updated quarterly; paid plans get monthly snapshots and daily change files. Includes works, authors, sources, institutions, topics, publishers, funders, and more. Best for: Full database replication, data warehousing, comprehensive analysis Size: ~330 GB compressed, ~1.6 TB decompressed Learn more about the snapshot formatOpenAlex CLI
The official command-line tool for downloading filtered subsets of OpenAlex data.Full-text PDFs
Download PDFs and TEI XML for about 60 million works. Requires an API key — content downloads cost $0.01 per file. Best for: Text mining, content analysis, building corpora Full-text PDF documentationDecision tree
Getting started
- For the snapshot: Follow the download instructions to get the data to your machine
- For the CLI: Install with
pip install openalex-officialand runopenalex download --help - For PDFs: See full-text PDFs for the three download options