[B2G][SpeechRTC][User Story]: Enabling Voice input in Firefox OS

RESOLVED WONTFIX

Status

defect
RESOLVED WONTFIX
5 years ago
2 years ago

People

(Reporter: skamat, Assigned: anatal)

Tracking

({feature})

Firefox Tracking Flags

(Not tracked)

Details

(Whiteboard: [webspeechapi])

User Story

As I user, I would like to have a support for voice input on my FxOS device for executing various actions on device, as well as fetching info from the cloud.

Attachments

(1 obsolete attachment)

(Reporter)

Description

5 years ago
As I user, I would like to have a support for voice input on my FxOS device for executing various actions on device, as well as fetching info from the cloud.

Current thought is to break it down into 2 phases
phase 1: Grammar based, local actions
phase 2: Natural lanaguage, server assisted actions

This will be used as a user story meta bug to track dependent work items.
(Reporter)

Updated

5 years ago
User Story: (updated)

Comment 1

5 years ago
Currently we have it running on Firefox. Steven, Can we do same approach linking against the pre-compiled lib as we did on Desktop at FxOS only to start testing?
Flags: needinfo?(slee)

Comment 2

5 years ago
(In reply to anatal from comment #1)
> Currently we have it running on Firefox. Steven, Can we do same approach
> linking against the pre-compiled lib as we did on Desktop at FxOS only to
> start testing?
Sure, we can do that. :)
Flags: needinfo?(slee)

Comment 3

5 years ago
Let's synch a more comfortable hour for us to work on it together.

Comment 4

5 years ago
This is the current status of Web Speech API support on browsers : http://caniuse.com/web-speech
(Reporter)

Comment 5

5 years ago
Update: 

A lot of progress was made since we met last and here is the summary. First, WebSpeechAPI integration is completed in a test build and we prepared a couple of demos. This implementation uses Acoustics/language models & decoder from PocketSphinx (all open source):


Desktop demo: (Firefox nightly on Mac): http://youtu.be/UcBvsU0fCPs

B2g (Flame) demo:  https://www.youtube.com/watch?v=0zqBbDmQlQ4

 

Completed Items
1. Coding the integration of pocketsphinx API with Web Speech API layer at Gecko
2. Modify gUM C++ layer to return pcm as 8khz 
3. Test the api with the speech decoder
 3.1 Adjust pocketsphinx parameters to enhance accuracy
 3.2 Define which languages we'll support initially -> Focused on English at this time
4. Include pocketsphinx sources on gecko and write the moz.build's for each library to be multi-platform and compiled with ./mach
5. Integrate the gecko-dev with b2g and compile them together to support FxOS (OK)

6. Test build Images ready for Mac and Flame (b2g) -- (Please send a note to ANatal@gmail.com)

 

Next Steps
1. Fix minor adjusts on API implementation, code reviews
2. Write mochitests (discussing with QA/Jonathan)
3. Write the prototype (grammar based) app integrated with Gaia. (codenamed "Vaani")

4. Create remaining desktop images for Windows (and Android)

5. Plan integration into the baselines (Gecko, b2g)

 

 

Thanks,

Sandip Kamat & Andre Natal

    From: "Sandip Kamat" <skamat@mozilla.com>
    To: dev-b2g@lists.mozilla.org, dev-webapps@lists.mozilla.org, "mozilla-dev-gaia@lists.mozilla.org" <dev-gaia@lists.mozilla.org>
    Cc: "André Natal" <anatal@gmail.com>, "Dietrich Ayala" <dietrich@mozilla.com>, "Josh Carpenter" <jcarpenter@mozilla.com>, "Larissa Shapiro" <lshapiro@mozilla.com>
    Sent: Tuesday, July 1, 2014 12:46:26 PM
    Subject: Enabling Voice Input in Open Web / Firefox OS

     

    "Many Voices, One Mozilla"

     

    Hi All,

     

    Here is the summary of high level draft plans we are beginning with for enabling Voice Input in Open Web / Firefox OS. One of our Firefox OS contributors Andre Natal (Brazil community) has done lots of preparatory work and is currently continuing on a GSOC (Google Summer of Code) project around this. The proposed 2 phases of the plan are in email below.

     

    Please note the releases and estimates marked below are *all tentative* (will change) and will be refined over next several months. We will continue adding updates here:

     

    Wiki

    https://wiki.mozilla.org/SpeechRTC_-_Speech_enabling_the_open_web

     

    Bugzilla

    Bug 1032964 - [B2G][SpeechRTC][User Story]: Enabling Voice input in Firefox OS

     

    Trello board to track status:

    https://trello.com/b/UWXblmKb/webspeech-api

     

    Github:

    https://github.com/andrenatal/gecko-dev

     

    There is lots to do here and we are just starting, so if you are interested, pls watch this wiki and help with the dependent bug# being added to the meta-bug above. This kind of project could use great community participation with contributing code, collecting / testing voice samples with various accents (Remember "many voices, One Mozilla") to improve the acoustic / language models, creating fun gamifications to achieve that and yes, we would need tons & tons of testing!
(Reporter)

Updated

5 years ago
Depends on: 1049931
(Reporter)

Comment 6

5 years ago
Phase 1 is tracked with a separate bug now in 1049931.
(Reporter)

Updated

5 years ago
Depends on: 1049937
(Reporter)

Comment 7

5 years ago
Phase 1 is tracked with a separate bug now in 1049937.
(Reporter)

Comment 8

5 years ago
Phase 2 is tracked with a separate bug now in 1049937.

Updated

5 years ago
Assignee: nobody → anatal

Updated

4 years ago
Whiteboard: [webspeechapi]

Updated

4 years ago
Assignee: anatal → kdavis

Comment 9

4 years ago
This is the part 1 of 8 for this bug. 

This patch that introduces the B2G specific build flags (all enabled initially):

* MOZ_WEBSPEECH - Enables/Disables the STT API
* MOZ_WEBSPEECH_MODELS - Enables/Disables the model installations
* MOZ_WEBSPEECH_POCKETSPHINX - Compiles/Doesn't Compile Pocketsphinx, Sphinxbase, and relevant XPCOM models   

The try for this patch is running here https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f8a8598fb48
Attachment #8604626 - Flags: review?(bugs)
(Assignee)

Updated

4 years ago
Assignee: kdavis → anatal
(Assignee)

Comment 10

4 years ago
(In reply to kdavis from comment #9)
> Created attachment 8604626 [details] [diff] [review]
> Part 1 of 8: Introduces the B2G specific build flags, initially enabled.
> 
> This is the part 1 of 8 for this bug. 
> 
> This patch that introduces the B2G specific build flags (all enabled
> initially):
> 
> * MOZ_WEBSPEECH - Enables/Disables the STT API
> * MOZ_WEBSPEECH_MODELS - Enables/Disables the model installations
> * MOZ_WEBSPEECH_POCKETSPHINX - Compiles/Doesn't Compile Pocketsphinx,
> Sphinxbase, and relevant XPCOM models   
> 
> The try for this patch is running here
> https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f8a8598fb48

This is the metabug. This patch should be included in its appropriate bug inside the tree.
(Assignee)

Updated

4 years ago
Attachment #8604626 - Flags: review?(bugs)

Updated

4 years ago
Attachment #8604626 - Attachment is obsolete: true
(Assignee)

Updated

2 years ago
Status: NEW → RESOLVED
Last Resolved: 2 years ago
Resolution: --- → WONTFIX
You need to log in before you can comment on or make changes to this bug.