Add polaris-datainsight-doc-extract skill
loading diff…
The Polaris AI DataInsight Doc Extract workflow converts Office documents (DOCX, PPTX, XLSX, HWP, HWPX) into a structured unifiedSchema JSON with a single API call. It extracts text, tables, charts, images, shapes, equations, headers, and footers. Before using this workflow, you must issue an API key from Polaris DataInsight and register it as the environment variable POLARIS_DATAINSIGHT_API_KEY.
Manually parsing each document format is brittle and expensive to maintain. This workflow provides one consistent extraction format across multiple file types, making downstream analytics, automation, and RAG pipelines much easier.
Polaris Office DataInsight API official documentation and the documented Doc Extract workflow in the skill.