Azure Search
Azure Search is a search-as-a-service solution that allows developers to incorporate a sophisticated search experience into web and mobile applications without having to worry about the complexities of full-text search and without having to deploy, maintain or manage any infrastructure
-
Support Azure Active Directory Password Authentication in Azure Search SQL indexer
Currently, using AAD Password Authentication mode as described in https://docs.microsoft.com/en-us/azure/sql-database/sql-database-aad-authentication-configure#active-directory-password-authentication-1 fails. Please support this option.
86 votesThank you for your feedback. We’re considering this for a future release of Azure Search.
Thanks,
Elad
Azure Search Product Team -
Azure Search should be able to index Cosmos DB Table API accounts / collections
Similar to the suggestion for MongoDB support
56 votesThank you for your feedback. We’re considering this for a future release of Azure Search.
Thanks,
Liam
Azure Search Product Team -
Increase Blob Indexer file size limits
The "Blob indexer: maximum characters of content extracted from a blob" limit seems rather restrictive, 32K for free, 64K for standard and 4MB for everything else. Plus when this limit is reached all you see in the azure portal is
[{ "key": null, "message": "Truncated extracted text to 32768 characters." }]
It'd be useful to know which documents are affected.
31 votesThank you for your feedback. We’re considering this for a future release of Azure Search.
Thanks,
Luis Cabrera
Azure Search Product Team -
Add an API for Text extraction from a file (or string/stream with file content)
Indexer https://azure.microsoft.com/en-us/documentation/articles/search-howto-indexing-azure-blob-storage/ does extraction of text from a file (document).
Instead of using azure blob (extra implementation and usage charge) and delay with processing a document from there, client will have possibility to extract text from document and do mergeOrUpload content.We know when file content was changed (or a new file created) and would like to handle that our self.
30 votesThank you for your feedback. We’re considering this for a future release of Azure Search. Essentially we need a push-API before document cracking and enrichment occurs.
Thanks,
Luis Cabrera
Azure Search Product Team -
Enable indexing of gzip compressed blobs
https://stackoverflow.com/questions/46245505/indexing-gzipped-blobs-in-azure-search
I am trying to setup Azure Search over Azure Blob. The json blobs are compressed using gzip.
When I try to index the blobs as-is, I get the exception :
"Error detecting index schema from data source: "Error processing blob https://mystorageaccount.blob.core.windows.net/mycontainer/urlencodedname with content type ''. Status:UnsupportedMediaType, error:""
Looks like Azure Search blob indexer does support indexing ZIP archives (application/zip MIME type), but not gzip-compressed files.
28 votes -
Add support to Blob indexer for OpenOffice formats
Enable searching of OpenOffice documents stored in Azure Blob storage by adding support for indexing OpenOffice formats such as .odt, .ott, .oth, .odm, .ods, .ots, .odp, .odg, and .otp.
18 votesThank you for your feedback. We’re considering this for a future release of Azure Search.
Thanks,
Liam
Azure Search Product Team -
Add xpath syntax to promote out interesting values from source [XML] documents to predefined index fields
I have lots of structured blobs (XML, JSON), I would like a mechanism to promote/extract a given field so it can be present in a specified index field rather than just rely on text in the content field? Or must I still run an external process to parse and write these promoted values instead of using built-in Azure Search features and the Data Source / Import process?
13 votes -
Add support for indexing PDF Custom meta data
PDF Files allow custom meta data fields. In an archive it is good practice that the meta data is not separated from the document and hence we use the custom fields. We do however want to be able to search our archives don't want to have to write custom extractors to add the meta data to the index
11 votes -
index/crawl OneNote with deep links for results
Be able to crawl OneNote with deep links for results
10 votesThank you for your feedback. We’re considering this for a future release of Azure Search.
Thanks,
Elad
Azure Search Product Team -
Ability to identify from what data source the index document was created
While indexing multiple data sources(azure tables) into same index I'd like to identify what data source produces specific document. This will allow to reference back to the system that can show original document.
7 votes -
Support for indexing multi-line JSON files
Support for indexing multi-line JSON files (https://en.wikipedia.org/wiki/JSON_Streaming). Currently blob indexer extracts entire json file as one document.
It would be also nice to pair this with the gzip support so that it will be able to index *.json.gz files
3 votesFor gunzip support, please vote for a dedicated suggestion: https://feedback.azure.com/forums/263029-azure-search/suggestions/31432054-enabled-indexing-of-gzip-compressed-blobs
Thanks!
Azure Search team -
StorSimple
Search over file shares in StorSimple
3 votesWe are considering adding crawlers for common stores and will also be looking a best practices for other stores.
- Don't see your idea?