Vocal Mind x Transcarent

An AI-guided voice and speech analysis tool that assists mental health practitioners in diagnosis during sessions.

At a glance

An accurate mental health diagnosis can take weeks up to years to determine. Vocal Mind provides a seamless solution that is incorporated into the clinical workflow. First, with the help of AI, voice patterns such as rhythm, emotion, brevity, and volume of speech are extracted. Then, doctors are provided with additional objective information about each session. The analysis is stored in a record database, allowing doctors and patients to track progress.

Selected as 1 of 9 teams to pitch to Cornell Tech faculty and partners

Team

1 Product designer (self)
2 Developers
1 Product manager
1 LLM

Timeline

4 months

Area

MVP validation

Discovery

How is tech currently integrated into healthcare processes?

Transcarent partnered with Cornell Tech to explore how tech could be leveraged in the healthcare industry. We researched the stakeholder ecosystem, current patient experience, and technological advancements to get a clear picture of the systems within the healthcare industry.

I led the user flow outline for a patient seeking treatment in today's healthcare industry. We noticed that there was an increase in technology integration from online consultations to prescription delivery after the pandemic.

We combined our industry and stakeholder research to visually represent the processes, players, and their positive or negative concerns.

User Interviews

Tech could be leveraged for early diagnostics, especially in mental health care.

I spearheaded the creation of our interview guides— one for medical providers and the other for people who have sought medical care. It was essential to understand their unique experiences and identify the interconnected issues.

“Providers are interested in generative AI, but the research is very new. AI is already used in diagnostics and pathology, classification tasks.”

“Providers are interested in generative AI, but the research is very new. AI is already used in diagnostics and pathology, classification tasks.”

Medical Student 
at Weill Cornell Medicine

Medical Student 
at Weill Cornell Medicine

5

Doctors

5

Patients

2

Med Students

User Insights

Doctors lack standardized mental health tools

Variability in mental health diagnostics criteria and limitations of subjective screening tools can lead to delayed or misdiagnosis.

Patients struggle to articulate their emotions

Patients have difficulty articulating their emotions, which slows or prevents timely diagnosis.

Market Research

$33.1bn

$33.1bn

Mental health market size in 2023, expected to grow 3% annually.

Mental health market size in 2023, expected to grow 3% annually.

IBIS World

IBIS World

6.1%

6.1%

CAGR for the evolution, diagnosis, and treatment of communication disorders

CAGR for the evolution, diagnosis, and treatment of communication disorders

Fortune Business Insight

Fortune Business Insight

90%

90%

Of hospitals are projected to use AI-driven tech for early diagnostics

Of hospitals are projected to use AI-driven tech for early diagnostics

CBI Insight

CBI Insight

design challenge
How might we…

Improve the mental health diagnosis process by utilizing an AI digital tool?

Risky Assumption Testing

We tested our product's technological feasibility and key desirability for MVP validation.

I collaborated with the engineers and product manager to identify and address the most uncertain and high-impact assumptions early in the process. We were able to prioritize critical features and reduce the risk of building something that doesn’t resonate with users or meet technical requirements, ultimately increasing the chances of the product’s success.

Assumption 1: The way people speak is affected by their mental state.

Ten people were enlisted to complete a week-long experiment in which they filled out a mood tracker and sent audio recordings about their day. Our questionnaire was modeled on the clinical guidelines for mental health diagnosis. We interpreted the recordings using an existing speech behavior analysis API by Humane AI.

Clinical guideline for mental healthcare diagnostics

Our questionnaire that we sent to participants

There was a significant and useful correlation found between the participants’ inputs from the mood tracker and API analysis.

Participant's mood tracker answers

Voice Sentiment Analysis AI API

We identified consistency in the AI’s insights throughout the experiment.

Day 1

“Confident and focused”

Day 2

“Tired, bit stressed, and focused”

Day 3

“Felt confident”

Assumption 2: People speak as they normally would when being recorded.

We compared the answers and mannerisms of interviewees while initially speaking unrecorded and after we began recording the conversation. The participants were randomly selected, unaware of the purpose of the research, and did not know ahead of time that they would be asked to be recorded halfway. 

80% had no significant voice change with or without recording.

We inferred that the changes in speech patterns were minimal and wouldn't drastically affect the accuracy of the analysis.

We inferred that the changes in speech patterns were minimal and wouldn't drastically affect the accuracy of the analysis.

75% took longer pauses as the conversation continued.

We took these pauses as an indicator that participants felt more comfortable the more they spoke with us.

We took these pauses as an indicator that participants felt more comfortable the more they spoke with us.

The Final Solution

Vocal Mind

An AI-guided voice and speech analysis tool that assists mental health practitioners in diagnosis during sessions.

User journey of a patient and practitioner leveraging Vocal Mind during their consultation session

Reflections

Collaboration is the key to success.

I thoroughly loved collaborating with my team as each of our unique perspectives was valued throughout the development process. We were able to thoughtfully consider the user's needs, the tech needed to make our solution feasible, and the roadmap for our product launch. This genuine involvement and curiosity led to a meaningful solution, and we were selected as 1 of 9 teams from a cohort of 80 groups to pitch to Cornell Tech faculty and partners.

© Kirsten Geiger 2024