Update: Microsoft will be moving away from UserVoice sites on a product-by-product basis throughout the 2021 calendar year. We will leverage 1st party solutions for customer feedback. Learn more here.

Azure Cognitive Services

Customer Feedback & Ideas for Azure Cognitive Services

Share your ideas for making Cognitive Services and the accompanying APIs work better for the applications you develop.


Catch up on the latest News and Updates


Share your Ideas and Feedback

To share your ideas on how we can make Cognitive Services better, click one of the categories underneath "Give Feedback" located in the sidebar menu to access the forum.


Documentation

API documentation available here. Within, you'll find:

§  Getting started samples
§  API References
§  Testing Consoles

Using one or more of the APIs as a "Free" preview?  Be sure to read our Terms of Service.

Contact Support

UserVoice is intended for product feedback. If you need product support, please contact either: Azure support (https://azure.microsoft.com/en-us/support/plans/) or ask a question on stack overflow (https://stackoverflow.com/questions/tagged/microsoft-cognitive)


Become a Cloud Design Insider!

Join Cloud Design Insiders, and help shape the future of Cognitive Services! As an insider, you’ll speak with program managers, designers & researchers, see new designs and ideas, provide feedback through surveys, and try out prototypes. Take the short survey to join the Cloud Design Insiders now, and we’ll see you in the community.


  1. We would like Form Recognizer's Supported locales of Pre-built Receipt v2.1 to add European languages.

    We would like Form Recognizer's Supported locales of Pre-built Receipt v2.1 to add European languages. Especially, French, German, Dutch, Spanish and Italian.

    11 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  2. We would like Form Recognizer's Supported locales of Pre-built "Invoices" v2.1 to add European languages.

    We would like Form Recognizer's Supported locales of Pre-built "Invoices" v2.1 to add European languages. Especially, French, German, Dutch, Spanish and Italian.

    8 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  3. Speaker diarization for more than 2 speakers

    Speaker diarization for more than 2 speakers.

    See this one: https://cognitive.uservoice.com/forums/555925-speaker-recognition/suggestions/34823824-add-support-for-speaker-diarization-for-untrained

    I dont feel this should be marked as resolved. Would expect support for at least 10 speakers. Additionally its currently really poor and switches between speaker 1 and 2 almost randomly. Please make this more intelligent. Its a deal breaker for us and I'm sure many others. Especially considering the google alternative can handle unlimited speakers and is far more accurate at identifying them.

    https://cloud.google.com/speech-to-text/docs/multiple-voices

    And no... expecting a sample to train it for each voice is not an option. We literally just need it to assign a number…

    8 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  4. After tagging an image the UI should return to the same page you left from, rather than resetting to Page 1.

    e.g. if you tag an image on page 630 of your selected Training Images as soon as you close the tagging dialog box the page view resets to Page 1. To get back to where you were tagging now requires hundreds of clicks to get back to the same page you were tagging, to continue.

    After the tagging dialog box the UI should return to the training images page you left.

    7 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  5. "Visualization of the current progress of the training" and "Function to stop training" in Custom Vision

    If the following features are available, training can be executed with Cutom Vision without any worries.


    • Visualization of the current progress of the training

    • Function to stop training

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  6. Generate accurate audio clip for each utterance

    Getting an audio clip for each utterance will make it possible to generate a basis for a human-labeled transcript for training a custom model. This will make it possible to gradually improve the recognition accuracy after every "session", by checking the transcription and the corresponding audio clip and fixing the text for incorrect transcriptions.

    Additionally the audio clip can be used as a live read-back of the original audio.

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  7. Need the new metric to check the number of characters used for text to speech on the Azure portal

    It is needed to be able to check the number of characters used for text to speech.
    Under the metrics tab on the Azure portal, we can only see the number of requests that have been made.

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  8. Add «cheque» to prebuilt models

    It would be really useful to have a prebuilt model for cheques, which are much more bound to a standard than current prebuilt models (invoices, business cards and receipts).

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  9. Magnetic Ink Character Recognition (MICR) for cheques

    Right now, Azure Form Recognizer mistakenly interpret MICR special encoding characters as regular characters, as you can see in the attached screenshot. It would be very userful that it ignore/handle those special characters.

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  10. Arabic language support for Form Recognizer

    Would really love to see Form Recognizer supporting Arabic. There is a great demand in the middle east to extract content from scanned Arabic PDF's.

    I was able to extract text and tables from Arabic Digital PDF's to a large extent, but it fails miserably when it comes to Scanned PDF's or images. All it returns is a gibberish text scattered across.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language  ·  Flag idea as inappropriate…  ·  Admin →
  11. Improve the Speech Studio Text Editor.

    Being able to change the type, color, size and even highlighting the font with colors in the text editor, this would be very practical.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  12. Add speech profiles in Speech Studio.

    Have the option of saving voice profiles for dialogues, and that these profiles include: voice, tone, rate, volume and intonation of the voice, so when you want to apply this profile, select the desired text and press the profile and that all the aforementioned values ​​apply.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  13. Add support for rotating images in CustomVision.ai portal

    Allow users of the portal to rotate prediction images.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  14. Form Recognizer - Return tables that span multple pages as one table

    It would be good to have an option where tables that span multiple pages, return as a single table item. This is especially useful where the continuation page does not contain the table headings.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  15. Form Recognizer Handle Multiple of Same Form

    Have the AI recognize when the same form exists with multiple versions / scans within a document. For example you can currently submit a 2 page document (same form different values on page 1 and page 2) It detects multiple pages, however you only get the values from the first page / form. I propose the AI could recognize that the values on page 2 (in this example) are the start of the recognized form again, and thereby provide results for both versions.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  General Cognitive Services Feedback  ·  Flag idea as inappropriate…  ·  Admin →
  16. Export Training dataset with labels and bounding boxes in customvision.ai

    Hi,

    Would be great to add an option, in addition to export models, to save a backup of the training set: data (images) and metadata (image id, label, bounding box)

    Thanks !

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  17. Add Go to End '>>', and Go to the Beginning '<<' action for Training Images displayed in Custom Vision.

    When there are many thousands of images tagged in the Training Images view of a Project there is no quick way to get the the end of the data set, or back to the beginning. It can take a great many clicks, 60 images at a time.

    A Go to End '>>', and Go to the Beginning '<<' action on this interface would help.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Vision  ·  Flag idea as inappropriate…  ·  Admin →
  18. Cognitive services : To purge it completely without the future option for restore

    Behalf of CX (Sagi - sagi.kovaliov@lazlo326.com )
    Currently after deleting cognitive service I cannot recreate a service with the same name because it’s being soft deleted for 48 hours.
    I have to manually recover the account using rest api and it’s a time consuming operation in my devops activities.
    Devops engineers usually destroy and create the same resources again and again during their development process to achieve a full automation using the pipelines.
    At least the dev team can add a feature to control the soft delete option for the cognitive service account like you have for key vault or…

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  General Cognitive Services Feedback  ·  Flag idea as inappropriate…  ·  Admin →
  19. Dictionary function in Speech Studio to ignore words.

    Add the function of a dictionary to Speech Studio which allows to ignore or change the pronunciation of a word in the whole document, that is, when adding this word in the dictionary, it is not read regardless of whether it appears 100 times in the same document and not having to mark it one by one.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
  20. Audio Offset / Duration for Best Result on normalized words

    The JSON and/or result object needs to have the offset and duration of the whole normalized word.
    I've reviewed the JSON and it still doesn't solve the problem. I need to know the relationship of the DisplayText words to the Word Timings in the detail When the DisplayText outputs 007 and the Word Timings output "double" "oh" "seven" as 3 different words I don't know that 007 = those three words as there is no reference. There needs to be a display word reference to the audio word to track offset/duration of an underlying audio file. The only option that…

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5
  • Don't see your idea?

Feedback and Knowledge Base