@Honcia13: 电子书秒变有声书的开源神器来了——Audiblez！把 EPUB 直接扔进去，几分钟就吐出高质量 M4B 有声书！用的 Kokoro 语音模型，仅82M参数，但听感自然到离谱。亮点： T4 GPU跑《动物庄园》只要5分钟支持中英…

X AI KOLs Timeline 2026/05/16 10:31 工具

open-source epub-to-audiobook tts kokoro text-to-speech audiobook-generation python-tool

摘要

Audiblez 是一个开源工具，能将 EPUB 电子书快速转换为高质量的 M4B 有声书，使用 Kokoro-82M 语音模型，支持多种语言和图形界面，通过 pip 一键安装。

电子书秒变有声书的开源神器来了——Audiblez！把 EPUB 直接扔进去，几分钟就吐出高质量 M4B 有声书！用的 Kokoro 语音模型，仅82M参数，但听感自然到离谱。亮点： T4 GPU跑《动物庄园》只要5分钟支持中英法日等9国语言自带图形界面，傻瓜式操作 pip install audiblez 一行命令搞定真正读书党和懒人福音！再也不用花钱买有声书了 https://github.com/santinic/audiblez…

查看原文

查看缓存全文

缓存时间: 2026/05/16 17:22

电子书秒变有声书的开源神器来了——Audiblez！

把 EPUB 直接扔进去，几分钟就吐出高质量 M4B 有声书！用的 Kokoro 语音模型，仅82M参数，但听感自然到离谱。

亮点：
T4 GPU跑《动物庄园》只要5分钟
支持中英法日等9国语言
自带图形界面，傻瓜式操作
pip install audiblez 一行命令搞定

真正读书党和懒人福音！再也不用花钱买有声书了 https://github.com/santinic/audiblez…

santinic/audiblez

Source: https://github.com/santinic/audiblez

Audiblez: Generate audiobooks from e-books

PyPI - Python Version PyPI - Version

v4 Now with Graphical interface, CUDA support, and many languages!

Audiblez GUI on MacOSX

Audiblez generates .m4b audiobooks from regular .epub e-books, using Kokoro’s high-quality speech synthesis.

Kokoro-82M is a recently published text-to-speech model with just 82M params and very natural sounding output. It’s released under Apache licence and it was trained on < 100 hours of audio. It currently supports these languages: 🇺🇸 🇬🇧 🇪🇸 🇫🇷 🇮🇳 🇮🇹 🇯🇵 🇧🇷 🇨🇳

On a Google Colab’s T4 GPU via Cuda, it takes about 5 minutes to convert “Animal’s Farm” by Orwell (which is about 160,000 characters) to audiobook, at a rate of about 600 characters per second.

On my M2 MacBook Pro, on CPU, it takes about 1 hour, at a rate of about 60 characters per second.

How to install the Command Line tool

If you have Python 3 on your computer, you can install it with pip. You also need espeak-ng and ffmpeg installed on your machine:

sudo apt install ffmpeg espeak-ng                   # on Ubuntu/Debian 🐧
pip install audiblez

brew install ffmpeg espeak-ng                       # on Mac 🍏
pip install audiblez

Then you can convert an .epub directly with:

audiblez book.epub -v af_sky

It will first create a bunch of book_chapter_1.wav, book_chapter_2.wav, etc. files in the same directory, and at the end it will produce a book.m4b file with the whole book you can listen with VLC or any audiobook player. It will only produce the .m4b file if you have ffmpeg installed on your machine.

How to run the GUI

The GUI is a simple graphical interface to use audiblez. You need some extra dependencies to run the GUI:

sudo apt install ffmpeg espeak-ng 
sudo apt install libgtk-3-dev        # just for Ubuntu/Debian 🐧, Windows/Mac don't need this
  
pip install audiblez pillow wxpython

Then you can run the GUI with:

audiblez-ui

How to run on Windows

After many trials, on Windows we recommend to install audiblez in a Python venv:

Open a Windows terminal
Create anew folder: mkdir audiblez
Enter the folder: cd audiblez
Create a venv: python -m venv venv
Activate the venv: .\venv\Scripts\Activate.ps1
Install the dependencies: pip install audiblez pillow wxpython
Now you can run audiblez or audiblez-ui
For Cuda support, you need to install Pytorch accordingly: https://pytorch.org/get-started/locally/

Speed

By default the audio is generated using a normal speed, but you can make it up to twice slower or faster by specifying a speed argument between 0.5 to 2.0:

audiblez book.epub -v af_sky -s 1.5

Supported Voices

Use -v option to specify the voice to use. Available voices are listed here. The first letter is the language code and the second is the gender of the speaker e.g. im_nicola is an italian male voice.

For hearing samples of Kokoro-82M voices, go here

Language	Voices
🇺🇸 American English	`af_alloy`, `af_aoede`, `af_bella`, `af_heart`, `af_jessica`, `af_kore`, `af_nicole`, `af_nova`, `af_river`, `af_sarah`, `af_sky`, `am_adam`, `am_echo`, `am_eric`, `am_fenrir`, `am_liam`, `am_michael`, `am_onyx`, `am_puck`, `am_santa`
🇬🇧 British English	`bf_alice`, `bf_emma`, `bf_isabella`, `bf_lily`, `bm_daniel`, `bm_fable`, `bm_george`, `bm_lewis`
🇪🇸 Spanish	`ef_dora`, `em_alex`, `em_santa`
🇫🇷 French	`ff_siwis`
🇮🇳 Hindi	`hf_alpha`, `hf_beta`, `hm_omega`, `hm_psi`
🇮🇹 Italian	`if_sara`, `im_nicola`
🇯🇵 Japanese	`jf_alpha`, `jf_gongitsune`, `jf_nezumi`, `jf_tebukuro`, `jm_kumo`
🇧🇷 Brazilian Portuguese	`pf_dora`, `pm_alex`, `pm_santa`
🇨🇳 Mandarin Chinese	`zf_xiaobei`, `zf_xiaoni`, `zf_xiaoxiao`, `zf_xiaoyi`, `zm_yunjian`, `zm_yunxi`, `zm_yunxia`, `zm_yunyang`

For more detaila about voice quality, check this document: Kokoro-82M voices

How to run on GPU

By default, audiblez runs on CPU. If you pass the option --cuda it will try to use the Cuda device via Torch.

Check out this example: Audiblez running on a Google Colab Notebook with Cuda .

We don’t currently support Apple Silicon, as there is not yet a Kokoro implementation in MLX. As soon as it will be available, we will support it.

Manually pick chapters to convert

Sometimes you want to manually select which chapters/sections in the e-book to read out loud. To do so, you can use --pick to interactively choose the chapters to convert (without running the GUI).

Help page

For all the options available, you can check the help page audiblez --help:

usage: audiblez [-h] [-v VOICE] [-p] [-s SPEED] [-c] [-o FOLDER] epub_file_path

positional arguments:
  epub_file_path        Path to the epub file

options:
  -h, --help            show this help message and exit
  -v VOICE, --voice VOICE
                        Choose narrating voice: a, b, e, f, h, i, j, p, z
  -p, --pick            Interactively select which chapters to read in the audiobook
  -s SPEED, --speed SPEED
                        Set speed from 0.5 to 2.0
  -c, --cuda            Use GPU via Cuda in Torch if available
  -o FOLDER, --output FOLDER
                        Output folder for the audiobook and temporary files

example:
  audiblez book.epub -l en-us -v af_sky

to use the GUI, run:
  audiblez-ui

Author

by Claudio Santini in 2025, distributed under MIT licence.

相似文章

@wsl8297: 想把电子书或文档做成有声书？很多工具不是声音太“机器人”，就是不支持字幕同步，折腾一圈还是不满意。我后来发现了开源项目 Abogen：支持 ePub、PDF、纯文本等，一键转高质量音频，还能自动生成同步字幕。它底层用 Kokoro 语…

X AI KOLs Timeline

Abogen 是一个开源工具，可将 ePub、PDF 等文档一键转为高质量音频，并自动生成同步字幕，支持语音混合器和多种部署方式。

@FakeMaidenMaker: 炸裂！这个开源项目免费文字转无 AI 味人声，还能克隆任何人的嗓音，并且用文字调整音色！ GitHub 狂揽 30K star，出自面壁智能 OpenBMB，VoxCPM 之前拿过 GitHub 和 HuggingFace 双榜第一。做…

X AI KOLs Timeline

VoxCPM2是OpenBMB开源的语音合成模型，采用无分词器的扩散自回归架构，支持30种语言、语音设计和可控语音克隆，仅需一句话即可克隆音色，或用文字创建全新声音，输出48kHz高质量音频，可商用。

@Jolyne_AI: 又在 GitHub 挖到一款颜值与实力兼具的电子书阅读器：Foliate，让阅读更顺手、更专注、更高效。它不只界面清爽耐看，还把查词、注释、朗读等常用能力做进同一个阅读场景里，读书不必来回切工具。 GitHub：http://githu…

X AI KOLs Timeline

Foliate 是一款开源的电子书阅读器，界面清爽，集成查词、注释、朗读等功能，支持 EPUB、PDF、Kindle 等格式，可在 Linux 上通过 Flatpak 或 Snap 安装。

@noahduck283: 可以下载任何 YouTube 视频、干净地去除人声、进行转录、翻译成 100 多种语言、克隆原声并完成全自动配音的工具。全程不到 2 分钟。100% 本地运行。免费把六个顶级开源模型缝进了一个网页"一键下载、去人声、转录、翻译、配音"的…

X AI KOLs Timeline

Voice-Pro 是一个整合了六个顶级开源模型（Whisper、Demucs、CosyVoice、F5-TTS 等）的网页工具，支持 YouTube 视频下载、去人声、转录、翻译、语音克隆和全自动配音，全程不到2分钟，100%本地运行且免费。

@LinearUncle: 推荐一家叫模思的中国公司的开源声音克隆仓库： MOSS-TTS 你朗读一段文字，它克隆你的声音，然后就可以用你的声音朗读任意文本，查看帖子详情看我实战如何使用，效果很好，可以以假乱真。 https://github.com/OpenMOS…

X AI KOLs Timeline

MOSS-TTS是模思公司推出的开源声音克隆模型，用户朗读少量文本即可克隆声音，随后可用克隆的声音生成任意语音，效果逼真。

santinic/audiblez

Audiblez: Generate audiobooks from e-books

v4 Now with Graphical interface, CUDA support, and many languages!

How to install the Command Line tool

How to run the GUI

How to run on Windows

Speed

Supported Voices

How to run on GPU

Manually pick chapters to convert

Help page

Author

相似文章

@FakeMaidenMaker: 炸裂！这个开源项目免费文字转无 AI 味人声，还能克隆任何人的嗓音，并且用文字调整音色！ GitHub 狂揽 30K star，出自面壁智能 OpenBMB，VoxCPM 之前拿过 GitHub 和 HuggingFace 双榜第一。 做…

提交意见反馈

@FakeMaidenMaker: 炸裂！这个开源项目免费文字转无 AI 味人声，还能克隆任何人的嗓音，并且用文字调整音色！ GitHub 狂揽 30K star，出自面壁智能 OpenBMB，VoxCPM 之前拿过 GitHub 和 HuggingFace 双榜第一。做…