Skip to content

pdf parsing using azure ai document intelligence #427

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Mar 18, 2025

Conversation

jwilliams-elastic
Copy link
Contributor

This is the actual notebook for an elastic search labs blog draft that is currently under review.

Blog Draft Document: https://docs.google.com/document/d/1Lz-S3pnXX1xHoovXT3u7z24abiWZKKPgTw6g6vFsUk8/edit?tab=t.0

Github Submssion: #422

Copy link

gitnotebooks bot commented Mar 17, 2025

Found 1 changed notebook. Review the changes at https://app.gitnotebooks.com/elastic/elasticsearch-labs/pull/427

@jwilliams-elastic
Copy link
Contributor Author

The automated tests are failing because the notebook requires credentials for Azure AI Document Search Intelligence and Elastic Cloud Serverless. This requirement is documented in the notebook.

@JessicaGarson
Copy link
Contributor

Thanks, @jwilliams-elastic. If you move this to the folder supporting-blog-content from ingestion-and-chunking, this should pass the tests.

@jwilliams-elastic
Copy link
Contributor Author

@JessicaGarson - Thanks for letting me know the right notebook location. All checks are passing now for this PR.

…. 1-increased elastic client timeout. 2-fixed issue in combine pararaph and table text
@jwilliams-elastic jwilliams-elastic merged commit 0ce41a3 into main Mar 18, 2025
2 checks passed
@jwilliams-elastic jwilliams-elastic deleted the pdf-azure-ai-document-intelligence branch March 18, 2025 15:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants