mini-omni.zip
大小:4.38MB
价格:10积分
下载量:0
评分:
5.0
上传者:fenglingguitar
更新日期:2025-09-22

Mini-Omni:语言模型可以在流式传输中聆听、交谈和思考

资源文件列表(大概)

文件名
大小
mini-omni/
-
mini-omni/tokenizer_config.json
1.26KB
mini-omni/lit_model.pth
135B
mini-omni/tokenizer.json
6.7MB
mini-omni/README.md
1.09KB
mini-omni/.gitattributes
1.48KB
mini-omni/model_config.yaml
848B
mini-omni/.git/
-
mini-omni/frameworkv3.jpg
167.82KB
mini-omni/.git/config
307B
mini-omni/.git/objects/
-
mini-omni/.git/HEAD
21B
mini-omni/.git/info/
-
mini-omni/.git/logs/
-
mini-omni/.git/description
73B
mini-omni/.git/hooks/
-
mini-omni/.git/refs/
-
mini-omni/.git/index
625B
mini-omni/.git/packed-refs
112B
mini-omni/.git/objects/95/
-
mini-omni/.git/objects/68/
-
mini-omni/.git/objects/3b/
-
mini-omni/.git/objects/33/
-
mini-omni/.git/objects/b3/
-
mini-omni/.git/objects/da/
-
mini-omni/.git/objects/f4/
-
mini-omni/.git/objects/eb/
-
mini-omni/.git/objects/pack/
-
mini-omni/.git/objects/7b/
-
mini-omni/.git/objects/2f/
-
mini-omni/.git/objects/6e/
-
mini-omni/.git/objects/info/
-
mini-omni/.git/objects/98/
-
mini-omni/.git/objects/a0/
-
mini-omni/.git/objects/a7/
-
mini-omni/.git/objects/a6/
-
mini-omni/.git/objects/46/
-
mini-omni/.git/info/exclude
240B
mini-omni/.git/logs/HEAD
189B
mini-omni/.git/logs/refs/
-
mini-omni/.git/hooks/commit-msg.sample
896B
mini-omni/.git/hooks/pre-rebase.sample
4.78KB
mini-omni/.git/hooks/pre-commit.sample
1.6KB
mini-omni/.git/hooks/applypatch-msg.sample
478B
mini-omni/.git/hooks/fsmonitor-watchman.sample
4.62KB
mini-omni/.git/hooks/pre-receive.sample
544B
mini-omni/.git/hooks/prepare-commit-msg.sample
1.46KB
mini-omni/.git/hooks/post-update.sample
189B
mini-omni/.git/hooks/pre-merge-commit.sample
416B
mini-omni/.git/hooks/pre-applypatch.sample
424B
mini-omni/.git/hooks/pre-push.sample
1.34KB
mini-omni/.git/hooks/update.sample
3.56KB
mini-omni/.git/hooks/push-to-checkout.sample
2.72KB
mini-omni/.git/refs/heads/
-
mini-omni/.git/refs/tags/
-
mini-omni/.git/refs/remotes/
-
mini-omni/.git/objects/95/341e1d6ae4b5086e60f09e98f1a4ef42aca7fa
91B
mini-omni/.git/objects/68/f1da89b775caff935efc24d5241bd10eeb677c
126B
mini-omni/.git/objects/3b/7ba72c19d27aac21f7967459a282f92659d48b
273B
mini-omni/.git/objects/33/ea6c72ebb92a237fa2bdf26c5ff16592efcdae
2.2MB
mini-omni/.git/objects/b3/a695259f376d4eaf2e78d8d995ddf1fea57736
857B
mini-omni/.git/objects/da/6c66ccafbb524fe0d3f046053124239d67c75c
475B
mini-omni/.git/objects/f4/b55f917af273d0dc98b67ec249f6445dd385f5
489B
mini-omni/.git/objects/eb/09342a1d90c60ff41325e0030971ff25e9fecd
652B
mini-omni/.git/objects/7b/e5fc7f47d5db027d120b8024982df93db95b74
37B
mini-omni/.git/objects/2f/9d8d44c202347e18efe7ed842959c1f5b3b6f6
137.38KB
mini-omni/.git/objects/6e/a96fe583cf6200c1133368ad38011aac72eeeb
810B
mini-omni/.git/objects/98/96323e09177b32edcceedd565a086a91d029ab
852B
mini-omni/.git/objects/a0/dec2a89823c6136f481649b303d5905a31f866
234B
mini-omni/.git/objects/a7/22089c58869095607cb52d19b2f5a0c82cfe15
850B
mini-omni/.git/objects/a6/344aac8c09253b3b630fb776ae94478aa0275b
224B
mini-omni/.git/objects/46/a665ebf65e1fb9b3ea646bfd6dc20618137d5c
234B
mini-omni/.git/logs/refs/heads/
-
mini-omni/.git/logs/refs/remotes/
-
mini-omni/.git/refs/heads/main
41B
mini-omni/.git/refs/remotes/origin/
-
mini-omni/.git/logs/refs/heads/main
189B
mini-omni/.git/logs/refs/remotes/origin/
-
mini-omni/.git/refs/remotes/origin/HEAD
30B
mini-omni/.git/logs/refs/remotes/origin/HEAD
189B

资源内容介绍

Mini-Omni 是一个开源多模型大型语言模型,可以一边听、一边说,一边思考。具有实时端到端语音输入和流音频输出对话功能。特征实时语音对话功能。无需额外的 ASR 或 TTS 模型。一边说话一边思考,能够同时生成文本和音频。流音频输出功能。 通过“音频到文本”和“音频到音频”批量推理进一步提升性能。
---license: mitlanguage:- enbase_model: Qwen/Qwen2-0.5B---<p align="center"><strong style="font-size: 18px;">Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming</strong></p><p align="center">🤗 <a href="">Hugging Face</a> | 📖 <a href="https://github.com/gpt-omni/mini-omni">Github</a> | 📑 <a href="https://arxiv.org/abs/2408.16725">Technical report</a></p>Mini-Omni is an open-source multimodel large language model that can **hear, talk while thinking**. Featuring real-time end-to-end speech input and **streaming audio output** conversational capabilities.<p align="center"> <img src="frameworkv3.jpg" width="100%"/></p>## Features✅ **Real-time speech-to-speech** conversational capabilities. No extra ASR or TTS models required.✅ **Talking while thinking**, with the ability to generate text and audio at the same time.✅ **Streaming audio outupt** capabilities.✅ With "Audio-to-Text" and "Audio-to-Audio" **batch inference** to further boost the performance.**NOTE**: please refer to https://github.com/gpt-omni/mini-omni for more details.

用户评论 (0)

发表评论

captcha