Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
I'm using the Azure AI Document Intelligence Python SDK (azure-ai-documentintelligence) to analyze documents stored in Azure Blob Storage. My current workflow involves:
Question:
begin_analyze_document for single document analysis instead of downloading and uploading the file bytes? I'd like to provide a blob URL (with SAS token or Managed Identity) and have Document Intelligence fetch the document directly.pages="1-100") so I can analyze specific pages without splitting the PDF myself? This would allow me to process a long document in ranges without downloading/chunking.AzureBlobContentSource used in batch processing? I know begin_analyze_batch_documents supports blob sources, but it seems like overkill for single document analysis.Is it possible to directly pass the Azure Blob Storage URL/path to the Document Intelligence service instead of downloading and uploading the file content? I want to avoid the intermediate step of fetching the blob content to my application before sending it to Document Intelligence.
begin_analyze_document for single document analysis. You need to download the document and then send the bytes to the service.pages="1-100"). Therefore, you would need to split the PDF yourself into chunks before sending them for analysis.AzureBlobContentSource for single document analysis. The batch processing method begin_analyze_batch_documents is designed for multiple documents, and it is indeed more suited for batch operations rather than single document analysis.In summary, you will need to handle the downloading and chunking of documents manually before sending them to the Document Intelligence service.