Commit 4331ece
Add Docling document processing skill
Universal document conversion skill using Docling library. Converts PDF, DOCX,
PPTX, XLSX, HTML, images, and audio to Markdown/HTML/JSON.
Features:
- Document conversion with layout-aware parsing
- OCR support (Tesseract, EasyOCR, RapidOCR, OcrMac)
- Vision Language Model pipeline (GraniteDocling)
- Table extraction to CSV/Excel
- RAG framework integrations (LangChain, LlamaIndex, Haystack)
- Batch processing with parallel execution
Includes 5 helper scripts and 4 reference guides.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>1 parent 69c0b1a commit 4331ece
File tree
11 files changed
+2653
-0
lines changed- skills/docling
- references
- scripts
11 files changed
+2653
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
0 commit comments