8000 Add sentence retreival and storage to transcription webhook. by boazsender · Pull Request #1831 · codeforboston/maple · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Add sentence retreival and storage to transcription webhook. #1831

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

boazsender
Copy link
Contributor

Adds sentence retrieval from Assembly as a backup for when speaker diarisation doesn't complete (more than 10 speakers) and no utterances are returned.

Copy link
vercel bot commented May 29, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
maple-dev ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jun 12, 2025 3:55pm

Copy link
Collaborator
@Mephistic Mephistic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, just the note about which AssemblyAI extract method to use. 👍 to doing this for all hearings to start, even those with acceptable diarization - we can always disable this conditionally if we find we're not getting enough value.

@@ -67,6 +68,10 @@ export const transcription = functions
if (authenticatedEventIds.length === 1) {
// If there is one authenticated event, pull out the parts we want to
// save and try to save them in the db.

const { sentences } = await assembly.transcripts.sentences(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just checked with Matt V - we'd prefer to use the extract paragraphs API rather than the extract sentences API - sentences can get pretty fragmented in some cases, especially around bill names.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0