Final Project¶
Requirements¶
Project Overview¶
Project Name: Capybara
Objective: Talking Capybara
Target User: for people who need an extra companion
Functional Requirements (mvp)¶
- Core Function: talk
- Input: mic
- Processing: speech to text - large language model - text to speech pipeline on external server
- Output: sound
Functional Requirements (v2)¶
- Core Function: see and talk
- Input: mic, camera
- Processing: speech to text - large multimodal language model - text to speech pipeline on external server
- Output: sound
Functional Requirements (v3)¶
- Core Function: see, talk and move
- Input: mic, camera
- Processing: speech to text - large multimodal language model with tool calls - text to speech pipeline on external server
- Output: sound, motor movements
Sketch¶
I used chatgpt to generate some sketches
v1
prompt: generate the shape of a capybara that is easy to design in cad and add color
v2
prompt: generate an image of how it would look if it were 3d printed and had components like a mic, speaker, battery, camera for a talking llm toy
v3
prompt: generate an image with motors