-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Unable to retrieve transcript from async API #553
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@garysieling The encoding you have specified is LINEAR16 which takes a .raw file. Once, you fix it from .flac to .raw it should work for you. Re: Another storage bucket - that's not supported and for that reason we have the long running operation see sample code here and proto definition here Hope it helps! |
Thanks, I converted some of the files to RAW, and now I'm getting a quota error with both my files and the examples provided in this project. What puzzles me is that is the Quotas screen doesn't show me near any limits. D:\projects\python-docs-samples\speech\api-client>python transcribe_async.py gs://sieling-wav/10000.raw |
You are using the rest sample which doesn't take the gcs location. Can you please use grpc async sample. |
Thanks, I tried that as well but I should have included the error: D:\projects>cd python-docs-samples\speech\grpc D:\projects\python-docs-samples\speech\grpc>python transcribe_async.py gs://sieling-wav/10000.raw --encoding LINEAR16 During handling of the above exception, another exception occurred: Traceback (most recent call last): |
@garysieling Are you getting the same error in the file which has been provided resources/quit.raw? I just want to confirm that you have followed these limits? https://cloud.google.com/speech/limits |
@garysieling - I'm guessing you authenticated by running |
Yes, I used a service account. I uploaded the "quit.raw" file to my bucket as @puneith recommended, and was able to transcribe that particular file. I verified that the files I've uploaded can be played (on my machine) and are within the limits specified in Google's docs. At this point I think the service is having issues with my files, but I can't find any way to tell what the problem could be, so I'm leaning toward using a different service. |
@garysieling we would like to figure out why its not working for you. Can you please send us the files you are trying to transcribe. Also, on the limits I also meant the API usage limits. |
I can do that, what's the easiest way for me to send these? On Tue, Oct 11, 2016 at 3:04 PM, puneith notifications@github.com wrote:
Gary Sieling |
You can either attach them here or send us gcs links (make sure you have made it public) if you are ok with that. |
Alright, I've chosen a 6 minute file to test, as that should not hit any limits. The source file is an MP3, which I converted to WAV for a different service, and then to flac using sox. I uploaded all three so that you can compare. I verified that the flac file can be played with VLC. These are set to public: gs://gsieling-flac/14823.flac Here is the command I used to do the flac conversion, which I obtained from Google's documentation (https://cloud.google.com/speech/support): In previous attempts, I also used "-v 0.98", but I found that I still have issues without it. I'm running this on Windows, tried both through git bash and through the normal command line. The output below is from an example. This is the same as I reported earlier on the ticket, so this is just to prove that I have the same issue for this specific file. (D:\Software\Anaconda2) d:\projects\python-docs-samples\speech\grpc>python transcribe_async.py --encoding LINEAR16 gs://gsieling-flac/14823.flac Waiting for server processing... |
@garysieling There is one error in your command. Please follow steps below:
Input File : '14823.flac'
I ran it on my machine and got the output. Please, note this is a big audio file and it takes a bit to get the output. See output attached. |
For reference, I believe you're hitting the fact that asynchronous transcribes work only on raw files, not flac, at the moment. |
* chore(deps): update all dependencies to v2.15.1 * revert Co-authored-by: Anthonios Partheniou <partheniou@google.com>
* chore(deps): update all dependencies to v2.15.1 * revert Co-authored-by: Anthonios Partheniou <partheniou@google.com>
* chore(deps): update all dependencies to v2.15.1 * revert Co-authored-by: Anthonios Partheniou <partheniou@google.com>
I converted my files from MP3s to FLAC, and uploaded them to a bucket.
python transcribe_async.py --encoding LINEAR16 --sample_rate 16000 gs://sieling-wav/10000.flac
name: "3946981841087526250"
Waiting for server processing...
Waiting for server processing...
etc, for some time.
{"metadata": {"startTime": "2016-10-05T00:05:38.830764Z", "progressPercent": 100, "@type": "type.googleapis.com/google.cloud.speech.v1beta1.AsyncRecognizeMetadata", "lastUpdateTime": "2016-10-05T00:06:22.396557Z"}, "name": "6946489939306634899", "done": true, "response": {"@type": "type.googleapis.com/google.cloud.speech.v1beta1.AsyncRecognizeResponse"}}
It appears that the API is doing the work (it takes 10-30 seconds to be marked as complete). I know this probably isn't in your power to change, but it would be nice if you could just store the transcript results in another storage bucket.
Is there some form of debugging I can enable to get you more information?
The text was updated successfully, but these errors were encountered: