Closed Bug 1032964 Opened 11 years ago Closed 7 years ago

[B2G][SpeechRTC][User Story]: Enabling Voice input in Firefox OS

Tracking

(Not tracked)

Status:

RESOLVED WONTFIX

People

(Reporter: skamat, Assigned: anatal)

References

Details

(Keywords: feature, Whiteboard: [webspeechapi])

User Story

As I user, I would like to have a support for voice input on my FxOS device for executing various actions on device, as well as fetching info from the cloud.

Attachments

(1 obsolete file)

Part 1 of 8: Introduces the B2G specific build flags, initially enabled. 10 years ago kdavis 2.66 KB, patch		Details \| Diff \| Splinter Review

Sandip Kamat

Reporter

Description

•

11 years ago

As I user, I would like to have a support for voice input on my FxOS device for executing various actions on device, as well as fetching info from the cloud. Current thought is to break it down into 2 phases phase 1: Grammar based, local actions phase 2: Natural lanaguage, server assisted actions This will be used as a user story meta bug to track dependent work items.

Sandip Kamat

Reporter

Updated

•

11 years ago

User Story: (updated)

Andre Natal

Comment 1

•

11 years ago

Currently we have it running on Firefox. Steven, Can we do same approach linking against the pre-compiled lib as we did on Desktop at FxOS only to start testing?

Flags: needinfo?(slee)

StevenLee[:slee]

Comment 2

•

11 years ago

(In reply to anatal from comment #1) > Currently we have it running on Firefox. Steven, Can we do same approach > linking against the pre-compiled lib as we did on Desktop at FxOS only to > start testing? Sure, we can do that. :)

Flags: needinfo?(slee)

Andre Natal

Comment 3

•

11 years ago

Let's synch a more comfortable hour for us to work on it together.

Andre Natal

Comment 4

•

11 years ago

This is the current status of Web Speech API support on browsers : http://caniuse.com/web-speech

Sandip Kamat

Reporter

Comment 5

•

11 years ago

Update: A lot of progress was made since we met last and here is the summary. First, WebSpeechAPI integration is completed in a test build and we prepared a couple of demos. This implementation uses Acoustics/language models & decoder from PocketSphinx (all open source): Desktop demo: (Firefox nightly on Mac): http://youtu.be/UcBvsU0fCPs B2g (Flame) demo: https://www.youtube.com/watch?v=0zqBbDmQlQ4 Completed Items 1. Coding the integration of pocketsphinx API with Web Speech API layer at Gecko 2. Modify gUM C++ layer to return pcm as 8khz 3. Test the api with the speech decoder 3.1 Adjust pocketsphinx parameters to enhance accuracy 3.2 Define which languages we'll support initially -> Focused on English at this time 4. Include pocketsphinx sources on gecko and write the moz.build's for each library to be multi-platform and compiled with ./mach 5. Integrate the gecko-dev with b2g and compile them together to support FxOS (OK) 6. Test build Images ready for Mac and Flame (b2g) -- (Please send a note to ANatal@gmail.com) Next Steps 1. Fix minor adjusts on API implementation, code reviews 2. Write mochitests (discussing with QA/Jonathan) 3. Write the prototype (grammar based) app integrated with Gaia. (codenamed "Vaani") 4. Create remaining desktop images for Windows (and Android) 5. Plan integration into the baselines (Gecko, b2g) Thanks, Sandip Kamat & Andre Natal From: "Sandip Kamat" <skamat@mozilla.com> To: dev-b2g@lists.mozilla.org, dev-webapps@lists.mozilla.org, "mozilla-dev-gaia@lists.mozilla.org" <dev-gaia@lists.mozilla.org> Cc: "André Natal" <anatal@gmail.com>, "Dietrich Ayala" <dietrich@mozilla.com>, "Josh Carpenter" <jcarpenter@mozilla.com>, "Larissa Shapiro" <lshapiro@mozilla.com> Sent: Tuesday, July 1, 2014 12:46:26 PM Subject: Enabling Voice Input in Open Web / Firefox OS "Many Voices, One Mozilla" Hi All, Here is the summary of high level draft plans we are beginning with for enabling Voice Input in Open Web / Firefox OS. One of our Firefox OS contributors Andre Natal (Brazil community) has done lots of preparatory work and is currently continuing on a GSOC (Google Summer of Code) project around this. The proposed 2 phases of the plan are in email below. Please note the releases and estimates marked below are *all tentative* (will change) and will be refined over next several months. We will continue adding updates here: Wiki https://wiki.mozilla.org/SpeechRTC_-_Speech_enabling_the_open_web Bugzilla Bug 1032964 - [B2G][SpeechRTC][User Story]: Enabling Voice input in Firefox OS Trello board to track status: https://trello.com/b/UWXblmKb/webspeech-api Github: https://github.com/andrenatal/gecko-dev There is lots to do here and we are just starting, so if you are interested, pls watch this wiki and help with the dependent bug# being added to the meta-bug above. This kind of project could use great community participation with contributing code, collecting / testing voice samples with various accents (Remember "many voices, One Mozilla") to improve the acoustic / language models, creating fun gamifications to achieve that and yes, we would need tons & tons of testing!

Sandip Kamat

Reporter

Updated

•

11 years ago

Depends on: 1049931

Sandip Kamat

Reporter

Comment 6

•

11 years ago

Phase 1 is tracked with a separate bug now in 1049931.

Sandip Kamat

Reporter

Updated

•

11 years ago

Depends on: 1049937

Sandip Kamat

Reporter

Comment 7

•

11 years ago

Phase 1 is tracked with a separate bug now in 1049937.

Sandip Kamat

Reporter

Comment 8

•

11 years ago

Phase 2 is tracked with a separate bug now in 1049937.

Andre Natal

Updated

•

11 years ago

Assignee: nobody → anatal

kdavis

Updated

•

10 years ago

Whiteboard: [webspeechapi]

kdavis

Updated

•

10 years ago

Assignee: anatal → kdavis

kdavis

Comment 9

•

10 years ago

Attached patch Part 1 of 8: Introduces the B2G specific build flags, initially enabled. (obsolete) — Details — Splinter Review

This is the part 1 of 8 for this bug. This patch that introduces the B2G specific build flags (all enabled initially): * MOZ_WEBSPEECH - Enables/Disables the STT API * MOZ_WEBSPEECH_MODELS - Enables/Disables the model installations * MOZ_WEBSPEECH_POCKETSPHINX - Compiles/Doesn't Compile Pocketsphinx, Sphinxbase, and relevant XPCOM models The try for this patch is running here https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f8a8598fb48

Attachment #8604626 - Flags: review?(bugs)

André Natal

Assignee

Updated

•

10 years ago

Assignee: kdavis → anatal

André Natal

Assignee

Comment 10

•

10 years ago

(In reply to kdavis from comment #9) > Created attachment 8604626 [details] [diff] [review] > Part 1 of 8: Introduces the B2G specific build flags, initially enabled. > > This is the part 1 of 8 for this bug. > > This patch that introduces the B2G specific build flags (all enabled > initially): > > * MOZ_WEBSPEECH - Enables/Disables the STT API > * MOZ_WEBSPEECH_MODELS - Enables/Disables the model installations > * MOZ_WEBSPEECH_POCKETSPHINX - Compiles/Doesn't Compile Pocketsphinx, > Sphinxbase, and relevant XPCOM models > > The try for this patch is running here > https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f8a8598fb48 This is the metabug. This patch should be included in its appropriate bug inside the tree.

André Natal

Assignee

Updated

•

10 years ago

Attachment #8604626 - Flags: review?(bugs)

kdavis

Updated

•

10 years ago

Attachment #8604626 - Attachment is obsolete: true

André Natal

Assignee

Updated

•

7 years ago

Status: NEW → RESOLVED

Closed: 7 years ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.