User Guide

The audioarxiv command-line tool allows you to fetch research papers from arXiv and read them aloud using a text-to-speech engine.

Basic Usage

$ audioarxiv --id 1602.03837

This will fetch the arXiv paper with ID 1602.03837, process its content, and read it aloud.

To save the audio to a file instead:

$ audioarxiv --id 1602.03837 --output audio.mp3

Reminder: It takes a long time to process a paper into an audio file.

Command-Line Options

Command-line Options for audioarxiv
Option	Description
`-h`, `--help`	Show the help message and exit.
`--id ID`	arXiv paper ID.
`--output OUTPUT`	Output to audio file if provided.
`--rate RATE`	Number of words per minute between 50 and 500.
`--volume VOLUME`	Volume between 0 and 1.
`--voice VOICE`	Voice to use for text-to-speech.
`--pause-seconds PAUSE_SECONDS`	Duration of pause between sentences in seconds.
`--page-size PAGE_SIZE`	Maximum number of results fetched in a single API request.
`--delay-seconds DELAY_SECONDS`	Number of seconds to wait between API requests.
`--num-retries NUM_RETRIES`	Number of times to retry a failing API request before raising an exception.
`--list-voices`	List the available voices.

List Available Voices

To explore which voices are supported by your system:

$ audioarxiv --list-voices

Configuration File

All command-line options can be saved in a configuration file for reuse. When the tool is run, it generates or loads settings from a config file stored locally. The path to this file is printed in the logs when you run the tool with --id.

This allows you to define and reuse your preferred settings (e.g. voice, volume, rate) without needing to type them every time.