-
Notifications
You must be signed in to change notification settings - Fork 3.2k
feat(eot): add audio models AGT-2520 #4722
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 107 commits
Commits
Show all changes
120 commits
Select commit
Hold shift + click to select a range
87068d5
add interface draft
chenghao-mou e0d5ec1
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 8eebccc
draft
chenghao-mou f92fbc0
fix type issues
chenghao-mou d1086ff
refactor stream to support turn detector protocol
chenghao-mou 0a02bb1
minor fixes
chenghao-mou 168d0d7
minor fixes
chenghao-mou 277db6e
WIP: use only ws stream
chenghao-mou 03c0e2e
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 56b4796
fix uv.lock bad merge
chenghao-mou be9a550
WIP: more refactoring
chenghao-mou 601229c
fix mypy
chenghao-mou c4d92f8
remove temp url
chenghao-mou e963d85
disable turn detection when agent is still speaking
chenghao-mou c529d79
minor refactoring
chenghao-mou 09baed8
fix type issues
chenghao-mou 3830638
wip
chenghao-mou f214aa0
clean up encoder
chenghao-mou c922f44
wip
chenghao-mou f94a0dd
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 604bfdc
update protos
chenghao-mou f9ec64a
minor fixes
chenghao-mou ddbf594
address comments
chenghao-mou d465564
add text fallback
chenghao-mou 6e7d6bf
add text fallback
chenghao-mou 200d634
fix threshold
chenghao-mou dbd11b0
remove temp deps
chenghao-mou 60004dd
support realtime model
chenghao-mou 6de53f4
fix type issues
chenghao-mou 4ed8a82
add id in logs
chenghao-mou 0db57ea
use threaded audio encoder
chenghao-mou bbcfc3a
close encoder
chenghao-mou 7e04332
update dep
chenghao-mou 04db92f
address comments
chenghao-mou 46fd3bf
add cloud agent worker token
chenghao-mou e4e8ef6
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou fc94068
fix type issues
chenghao-mou 999edd5
add token in header instead
chenghao-mou cde90de
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 3603f04
wip
chenghao-mou 6272402
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 3bc3ff3
refactor for the cloud model
chenghao-mou a08b624
add support for both v1 and v1-mini
chenghao-mou f435571
fix example
chenghao-mou 8e75d60
address comments
chenghao-mou cf54cbe
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 4f10a69
address comments
chenghao-mou e96f1be
clean up session _on_error annotation
chenghao-mou 97400d2
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou b1e9294
merge inference and local eot code
chenghao-mou 49f0de0
update tests
chenghao-mou 7fe2bfb
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 8b150aa
clean up
chenghao-mou 28af3f5
minor refactor and clean up
chenghao-mou 75ddae6
refactor
chenghao-mou 76cec5d
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 2ccf54d
refactor
chenghao-mou 7fbca08
clean up
chenghao-mou 82c599a
refactor
chenghao-mou 4b6fdb5
clean up
chenghao-mou 7500160
more refactoring
chenghao-mou efe8d5c
fix makefile indentation
chenghao-mou 3237f9d
update protocol
chenghao-mou 09cdb0c
add direct commit for late stt transcripts
chenghao-mou f02d24e
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 0a6a66d
add local inference dependency
chenghao-mou 80fbc29
update deps
chenghao-mou 21310ee
use local inference pacakge and deprecate the turn detector package
chenghao-mou 5408ae1
feat(vad): bundle optimized silero vad and deprecate the plugin (#5800)
chenghao-mou 442d857
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 9d9cb52
fix type issue
chenghao-mou 2b4cc7e
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou a42cf7a
refactoring
chenghao-mou f592a16
drop duplicate calls and simplify triggers
chenghao-mou 82ad113
fix vad restore bug
chenghao-mou f93f7ca
adjust thresholds for the cloud model
chenghao-mou 7ff1eb1
update warning message
chenghao-mou e9b8a1e
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou cc6b6a9
avoid duplicate eot event fire
chenghao-mou 1447d1d
fix env var
chenghao-mou cbf3a3a
fix env and example
chenghao-mou f4a91a1
clean up examples and readme
chenghao-mou 09ae669
more clean up and refactoring
chenghao-mou f8c9672
adjust default vad value to match the eot requirement
chenghao-mou 5e47173
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 63521b2
fix user speaking state bug
chenghao-mou 6c713de
livekit-agents@1.6.0.rc1 (#5866)
github-actions[bot] 21c9604
more default refactoring
chenghao-mou 719b76e
more refactoring and clean up
chenghao-mou bde8445
more refactoring and clean up
chenghao-mou 7d71be9
address comment
chenghao-mou a4d1414
Merge remote-tracking branch 'origin/main' into feat/AGT-2520-multimo…
chenghao-mou 545f115
rename backend to model
chenghao-mou 080216a
livekit-agents@1.6.0.rc2 (#5891)
github-actions[bot] 1b48088
improve default parsing
chenghao-mou 66d94c2
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou c512a5d
clean up error logs
chenghao-mou d417609
reformat
chenghao-mou 3bc119c
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou e4f7fad
fix tests
chenghao-mou 28e54ee
refactor(eot): rename AudioTurnDetector to TurnDetector, model arg to…
chenghao-mou b4f5d41
fix(tests): init _user_silence_ev in EOU recognition test helper
chenghao-mou c2ee019
Merge remote-tracking branch 'origin/main' into feat/AGT-2520-multimo…
chenghao-mou 1a3a270
skip None threshold or probability events
chenghao-mou b99d4d1
docs: add LiveKit Model License and reference it in README
chenghao-mou 6cbb55e
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou f8a6ce7
Update livekit-agents/livekit/agents/inference/_utils.py
chenghao-mou d74df8d
drop FSM in stream and move logic to audio recognition
chenghao-mou 66f0e1c
remove TurnDetector reference
chenghao-mou 157e7b4
restore deps due to bad merge
chenghao-mou b73663d
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou 62cd057
fix(voice): route and drain AsyncToolset executors correctly on hando…
toubatbrian 1e1332e
fix(deepgram): use stored language when validating model in update_op…
adityajha2005 a528fc9
simulation: read the dispatch from the simulator participant metadata…
theomonnom c8f0e26
remove the next-release changeset machinery (#6054)
theomonnom 13af956
fix(bargein): error when no interruption threshold is known (#6034)
chenghao-mou 671ef28
Merge branch 'main' into feat/AGT-2520-multimodal-EOU
chenghao-mou c72938c
adjust default endpointing delays for the audio model
chenghao-mou 2328f3a
address comment: keep EP mode&alpha overrides
chenghao-mou a1972b0
feat(eot): emit agent backchannel opportunity events (#6099)
chenghao-mou File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,113 @@ | ||
| LIVEKIT MODEL LICENSE AGREEMENT | ||
|
|
||
| 1. Introduction | ||
|
|
||
| LiveKit Incorporated ("LiveKit") is making available its proprietary models for | ||
| use pursuant to the terms and conditions of this Agreement. As further | ||
| described below, you may use these LiveKit models freely but can only use them | ||
| together with the LiveKit Agents framework. You cannot use the LiveKit models | ||
| on a standalone basis or with any other frameworks. | ||
|
|
||
| BY CLICKING "I ACCEPT," OR BY DOWNLOADING, INSTALLING, OR OTHERWISE ACCESSING | ||
| OR USING THE LIVEKIT MATERIALS, YOU AGREE THAT YOU HAVE READ AND UNDERSTOOD, | ||
| AND, AS A CONDITION TO YOUR USE OF THE LIVEKIT MATERIALS, YOU AGREE TO BE | ||
| BOUND BY, THE FOLLOWING TERMS AND CONDITIONS. | ||
|
|
||
| 2. Definitions | ||
|
|
||
| "Agreement" means this LiveKit Model License Agreement. | ||
|
|
||
| "Documentation" means the specifications, manuals, and documentation | ||
| accompanying any LiveKit Model and distributed by LiveKit. | ||
|
|
||
| "Licensee" or "you" means the individual or entity agreeing to be bound by | ||
| this Agreement. | ||
|
|
||
| "LiveKit Agents" means the proprietary LiveKit software framework for building | ||
| real-time multimodal AI applications with programmable backend participants. | ||
|
|
||
| "LiveKit Materials" means, collectively, the LiveKit Models and Documentation. | ||
|
|
||
| "LiveKit Model" means any of LiveKit's proprietary software models or | ||
| algorithms, including machine-learning software code, model weights, | ||
| inference-enabling software code, training-enabling software code, and | ||
| fine-tuning enabling software code. Any derivative works of a LiveKit Model, | ||
| whether developed by LiveKit, you, or any third party, will be deemed the | ||
| "LiveKit Model" for the purposes of this Agreement. | ||
|
|
||
| 3. License Rights | ||
|
|
||
| Right to Use LiveKit Materials. Subject to the terms and conditions of this | ||
| Agreement, including the requirements of Section 3.b, LiveKit grants you a | ||
| nonexclusive, nontransferable, worldwide, royalty-free license under LiveKit's | ||
| intellectual property rights to use, reproduce, distribute, copy, and create | ||
| derivative works of the LiveKit Materials. | ||
|
|
||
| Limitation on Use. As a condition to your use of the LiveKit Materials, you | ||
| agree: (i) not to use any LiveKit Models on a standalone basis or with any | ||
| frameworks other than LiveKit Agents; (ii) not to use any LiveKit Materials or | ||
| any output from, or results of using, LiveKit Models (including any derivative | ||
| works thereof) to improve or otherwise develop any other models that are not | ||
| LiveKit Models; or (iii) distribute or otherwise make available the LiveKit | ||
| Materials (including any derivative works thereof) except (x) pursuant to the | ||
| terms of this Agreement, and (y) you reproduce the above copyright notice. | ||
|
|
||
| 4. Intellectual Property | ||
|
|
||
| The LiveKit Materials are owned by LiveKit and its licensors. Except for the | ||
| rights granted to you under this Agreement, all rights are reserved and no | ||
| other express or implied rights are granted. | ||
|
|
||
| You will own any derivative works that you created from the LiveKit Materials, | ||
| subject to the terms of this Agreement. | ||
|
|
||
| 5. Disclaimer | ||
|
|
||
| UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING, LIVEKIT PROVIDES | ||
| THE LIVEKIT MATERIALS, AND ANY OUTPUT OR RESULTS THEREFROM, ON AN "AS IS" | ||
| BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, EITHER EXPRESS OR IMPLIED, | ||
| INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OR CONDITIONS OF TITLE, | ||
| NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU | ||
| ARE SOLELY RESPONSIBLE FOR DETERMINING THE APPROPRIATENESS OF USING OR | ||
| REDISTRIBUTING THE LIVEKIT MATERIALS AND ASSUME ANY RISKS ASSOCIATED WITH YOUR | ||
| USE OF THE LIVEKIT MATERIALS AND ANY OUTPUT AND RESULTS. | ||
|
|
||
| 6. Limitation of Liability | ||
|
|
||
| IN NO EVENT AND UNDER NO LEGAL THEORY, WHETHER IN TORT (INCLUDING NEGLIGENCE), | ||
| CONTRACT, OR OTHERWISE, UNLESS REQUIRED BY APPLICABLE LAW (SUCH AS DELIBERATE | ||
| AND GROSSLY NEGLIGENT ACTS) OR AGREED TO IN WRITING, WILL LIVEKIT BE LIABLE TO | ||
| YOU FOR INDIRECT DAMAGES, INCLUDING ANY SPECIAL, INCIDENTAL, OR CONSEQUENTIAL | ||
| DAMAGES OF ANY CHARACTER ARISING AS A RESULT OF THIS AGREEMENT OR OUT OF THE | ||
| USE OR INABILITY TO USE THE LIVEKIT MATERIALS OR ANY OUTPUT OR RESULTS | ||
| THEREFROM (INCLUDING BUT NOT LIMITED TO DAMAGES FOR LOSS OF GOODWILL, WORK | ||
| STOPPAGE, COMPUTER FAILURE OR MALFUNCTION, OR ANY AND ALL OTHER COMMERCIAL | ||
| DAMAGES OR LOSSES), EVEN IF LIVEKIT HAS BEEN ADVISED OF THE POSSIBILITY OF | ||
| SUCH DAMAGES. | ||
|
|
||
| 7. Trademarks | ||
|
|
||
| This Agreement does not grant permission to use the trade names, trademarks, | ||
| service marks, or product names of LiveKit, except as required for reasonable | ||
| and customary use in describing the origin of the LiveKit Materials. | ||
|
|
||
| 8. Term and Termination | ||
|
|
||
| The term of this Agreement commences upon your acceptance of this Agreement | ||
| and continues in effect until you cease using the LiveKit Materials or it is | ||
| terminated by either party (on immediate written notice to the other party). | ||
| This Agreement will automatically terminate if you breach any of its terms. | ||
| Upon termination, you must immediately cease all use of the LiveKit Materials. | ||
| Sections 4, 5, 6, and 9 will survive termination. | ||
|
|
||
| 9. Governing Law and Venue | ||
|
|
||
| This Agreement is subject to the laws of the State of California, without | ||
| regard to its conflict of laws principles. The UN Convention on Contracts for | ||
| the International Sale of Goods does not apply to this Agreement. The courts | ||
| located in San Francisco, California, have exclusive jurisdiction for any | ||
| dispute arising out of this Agreement. | ||
|
|
||
| + + + + | ||
|
|
||
| Last Updated: November 25, 2024 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.