Job Description
Ready to Pioneer the Next Frontier in AI Communication?
Imagine a world where interacting with AI is as natural as chatting with a friend. Our client, a groundbreaking startup, is turning this vision into reality by developing advanced AI systems that blend the power of language models with seamless voice and video integration. You’ll be working on the latest approaches in Conversational AI, Speech AI and Multimodal models.
This isn't just another AI company – they're redefining the landscape of human-machine interaction. Despite recent acquisition offers, the team remains committed to their long-term vision, believing in the transformative potential of their technology.
As a key player in this venture, you'll:
Spearhead the creation of cutting-edge Conversational and multimodal AI systems
Collaborate with top-tier LLM providers to push technological boundaries
Assemble and lead a diverse team of AI visionaries
Innovate in model efficiency, accuracy, and performance
Craft advanced tools for AI evaluation and optimization
The company has already made waves with their initial prototype, garnering significant user engagement. Now, they're gearing up for the next phase – a revolutionary speech-to-speech AI system that promises to redefine conversational AI.
We're seeking candidates who are:
Skilled in Speech AI. Ideally TTS or Speech-to-Speech. Candidates with SOTA Speech processing may also be considered
Proven AI innovators with a history of impactful contributions
Experienced in nurturing and guiding high-performance teams
Well-versed in LLMs and generative AI technologies
Relevant publications in the space
Recent PhD with specific research in Speech-to-Speech or Speech/Audio generation are also encouraged to apply.
Work Environment: While the position offers some remote flexibility within the US, occasional travel to Seattle for collaborative sessions is preferred.
Compensation: The company offer competitive salaries reflective of your expertise. The wider package includes attractive stock options and performance bonuses.
Senior roles: $200,000 - $260,000+ base
PhD and mid-level positions: $160,000 - $200,000 base
If you're excited about shaping the future of AI interaction and want to be at the forefront of voice technology innovation, we want to hear from you. Join a team dedicated to breaking new ground in the exciting world of multimodal AI.
Ready to take the leap? Reach out today to start your journey towards redefining AI communication. We reply to everyone.
Questionnaire
Do you have experience in Speech Generation? Please select Yes No
Have you worked with Language models? Please select Yes No
Are you based in the US with the right to work in the US? Please select Yes No