@openpets/arxiv
v1.0.0
Published
Search, discover, and retrieve metadata for academic papers on arXiv.org. Supports full-text search across titles, authors, abstracts, and categories with date filtering, sorting, and pagination.
Downloads
74
Maintainers
Readme
arXiv
Search and retrieve academic papers from arXiv.org, the open-access repository of electronic preprints for physics, mathematics, computer science, and more.
Setup
No API key is required. The arXiv API is completely free and public.
bun installOptionally configure a default max results limit:
cp .env.example .env
# Edit .env to set ARXIV_MAX_RESULTS (default: 10)Tools
| Tool | Description |
|------|-------------|
| arxiv-test-connection | Verify the arXiv API is reachable |
| arxiv-search | Search papers by query, author, title, abstract, category, date range |
| arxiv-get-paper | Get full metadata for a single paper by arXiv ID |
| arxiv-get-papers | Batch retrieve metadata for multiple papers by ID |
Search Fields
The arxiv-search tool supports targeted field searches:
| Parameter | arXiv Field | Example |
|-----------|-------------|---------|
| query | All fields | "transformer architecture" |
| title | Title only | "attention is all you need" |
| author | Author name | "Vaswani" |
| abstract | Abstract only | "self-attention mechanism" |
| category | arXiv category | "cs.CL" |
| journalRef | Journal reference | "Nature" |
arXiv Categories
Common categories:
cs.AI- Artificial Intelligencecs.CL- Computation and Language (NLP)cs.CV- Computer Visioncs.LG- Machine Learningcs.CR- Cryptography and Securitymath.CO- Combinatoricsphysics.hep-th- High Energy Physics (Theory)stat.ML- Machine Learning (Statistics)
Full taxonomy: https://arxiv.org/category_taxonomy
Example Queries
search arxiv for transformer architecture papers
find recent papers by Yann LeCun
search arxiv for papers about large language models in cs.CL
get arxiv paper 2301.07041
search arxiv for quantum computing papers submitted after 20240101Rate Limits
The arXiv API recommends a 3-second delay between requests. This plugin enforces that automatically.
Notes
- The arXiv API returns Atom 1.0 XML; this plugin parses it into structured JSON
- Results are cached daily by arXiv (no updates until next midnight UTC)
- Maximum 100 results per request, 30,000 total via pagination
- For bulk data access, see https://info.arxiv.org/help/bulk_data.html
