The arXiver dataset, recently released on Hugging Face by the neuralwork group, provides access to 138,830 arXiv papers converted into multi-markdown (.mmd) format. This dataset includes original arXiv IDs, titles, abstracts, authors, publication dates, URLs, and corresponding markdown files, offering a comprehensive resource for researchers and developers working with scientific literature.