SamuKata
yum
yum

gumroad


TaSTT (modular avatar)

.unitypackage with modular avatar prefab

This is an OSC speech-to-text chatbox that you can attach to your avatar with modular avatar.

You will need the corresponding free OSC app (download). This app transcribes your voice locally using an optimized version of OpenAI's Whisper algorithm, then sends the transcript into VRChat with OSC.

Features:

Requirements:

Avatar performance:

Strawman FAQ:

Legal:

tastt_modular_v01.unitypackage

Old versions

Here is a 51-second overview of the setup process in Unity:

TaSTT (modular avatar) setup overview

The process is described in greater detail below.

To get a test avatar:

  • Enable Modular Avatar using the VRC Creator Companion.

  • Import the TaSTT .unitypackage

  • Drag yum_food/TaSTT_Modular/TaSTT.prefab onto your avatar hierarchy. DO NOT put it in your armature! Place it next to your meshes as shown in the video.

  • Set the parent constraint on TaSTT/Container to either your dominant hand (I use my right hand) or your head.

  • Fine tune the position of TaSTT/Container/TaSTT by dragging and rotating it.

  • Build and test.

  • Download the latest version of the OSC app (direct link). Extract the .zip and double click TaSTT.exe. You might get a security warning. Just hit more options then run anyway.

    • Why does the security warning come up? Because my code is not signed. Getting a code signing cert is expensive. If you're uneasy, run your antivirus of choice before executing.

  • Hit list input devices. Identify the number associated with your microphone, then select it in the Microphone dropdown.

    • The first time you press this button, the app will acquire all its runtime Python dependencies. Please be patient, it could take a while. It will only have to do this once.

    • If you're using an index or a bigscreen beyond, you can just select index or beyond.

  • Set Characters per sync to 10.

  • Set Text box rows to 4.

  • Set Text box columns to 40.

  • Deselect Use built-in chatbox.

  • If you're on a 1000 series NVIDIA GPU, set GPU compute type to float32.

  • Hit Begin transcribing. You should see your speech being transcribed in the OSC app.

  • Press ctrl+alt+x to make the chatbox appear in game. You should see your speech show up on the box.

This is how my OSC app is configured to use the custom chatbox on a 3000 series GPU:

To upload your avatar:

  • In the OSC app, go to Debug then hit Clear OSC configs.

    • Why is this necessary? Your avatar has a bunch of parameters. OSC needs to know what parameters your avatar has. VRChat generates a file telling OSC what parameters your avatar exposes. For some reason, VRChat generates the config once then never updates it, even if you change your parameters. Deleting OSC configs forces VRChat to regenerate the config when you upload.

    • What does this do? It deletes AppData/LocalLow/VRChat/vrchat/OSC.

    • Is it safe? Yes. This does not affect any already-uploaded avatars. It'll just force the VRC SDK to generate new OSC configs whenever you upload.

  • Upload.

If you need further assistance, please check my discord server. There's some common troubleshooting info #debug-shortlist. Further help requests should be posted in #help.

TaSTT (modular avatar)

More Creators