User Guide
The audioarxiv command-line tool allows you to fetch research papers from arXiv and read them aloud using a text-to-speech engine.
Basic Usage
$ audioarxiv --id 1602.03837
This will fetch the arXiv paper with ID 1602.03837, process its content, and read it aloud.
To save the audio to a file instead:
$ audioarxiv --id 1602.03837 --output audio.mp3
Reminder: It takes a long time to process a paper into an audio file.
Command-Line Options
Option |
Description |
---|---|
|
Show the help message and exit. |
|
arXiv paper ID. |
|
Output to audio file if provided. |
|
Number of words per minute between 50 and 500. |
|
Volume between 0 and 1. |
|
Voice to use for text-to-speech. |
|
Duration of pause between sentences in seconds. |
|
Maximum number of results fetched in a single API request. |
|
Number of seconds to wait between API requests. |
|
Number of times to retry a failing API request before raising an exception. |
|
List the available voices. |
List Available Voices
To explore which voices are supported by your system:
$ audioarxiv --list-voices
Configuration File
All command-line options can be saved in a configuration file for reuse.
When the tool is run, it generates or loads settings from a config file stored locally.
The path to this file is printed in the logs when you run the tool with --id
.
This allows you to define and reuse your preferred settings (e.g. voice, volume, rate) without needing to type them every time.