Allow limiting blob documents to be indexed based on a specific metadata value
We only want to index a subset of documents in our blob container and in order to do so now, we have to have two blob containers and manage them. Similar to how you can limit document types to be indexed or not, the ability to restrict the scope of Blob objects based on a metadata value would help reduce our operating expenses and document management overhead. We could add a new Metadata name called "AzureSearch" and if set to "true", would be picked up by the indexer. Removing it from the index would simply require changing that value and it would be removed from the index during the next update.
Thank you for the feedback. Please let us know if this article helps your scenario: https://docs.microsoft.com/en-us/azure/search/search-howto-indexing-azure-blob-storage#using-blob-metadata-to-control-how-blobs-are-indexed
You can specify if a blob should be skipped during indexing by modifying that blob’s metadata.
Azure Search Product Team
Erez Wolf commented
This could work. what if one wants to re-add it to be indexed (ie undo the exclusion)?