"Avatar" is being stretched to cover everything from learned-data signing generation to 3D models to video-to-video skin overlays on a human signer. These are fundamentally different technologies. Who gets to define the terms — and should conferences require presenters to label their tech honestly?
In the sign language AI space, "avatar" now covers at least three very different things: true signing generation from learned data, 3D rigged models, and video-to-video tools that take a real human performance and swap the skin or face. Lumping them under one word makes it impossible for the community, researchers, procurement officers, or even funders to compare what's actually being built. Without shared definitions — and without venues enforcing them — misleading labels become the norm and trust erodes. What should the taxonomy look like, and whose job is it to hold the line?
Prompts to get you thinking
- What are the distinct technologies being called "avatars" right now, and what should each actually be called?
- Should conferences, publications, or funders require technical disclosure of how a signing "avatar" is generated?
- Have you seen a demo where the label didn't match the technology? What was missed?
Join free to reply
Replies
No replies yet — be the first.
This thread just opened. The first voices here shape how the rest of the conversation unfolds. Post your perspective above.