docs/integrations/sources/box-data-extract.md
The Box Data Extract Connector enables seamless data extraction from Box, allowing users to access file content from their Box cloud storage.
This connector helps automate workflows by using Box AI to extract data directly from documents stored in Box.
For example, a company managing lease contracts can automatically capture key details and populate their system of record. Likewise, a financial institution can extract critical data from loan application documents—such as bank statements and W-2s—and integrate it into approval workflows.
By prioritizing content intelligence, this connector unlocks new opportunities for automation and AI-driven insights.
<HideInUI>This page contains the setup guide and reference information for the Box Data Extract source connector.
</HideInUI>You will need a Box application configured to use Client Credential Grants (CCG) Follow this guide to complete authentication.
From your box app configuration take note of:
Client ID: You Box App client ID. Find yours in the Box App configurations.Client Secret: You Box App client secret.Decide on what account is going to login to Box:
Box Subject Type: Represents the type of user to login as ("user" or "enterprise"). Enterprise will login with the application service account. User will login with the user if app can impersonate users.Box Subject ID: If subject type is "enterprise", use your enterprise ID If subject type is "user", use the user id to login as.Choose the which Box folder conatins the files you want to process:
Folder ID: Folder to retreive data from.Recursive: Read the folders recursively.If you are using Box AI you'll need:
Ask AI Prompt: If using the Ask AI, what prompt to send the AI about the documentExtract AI Prompt: If using the Extract AI, what prompt to send the AI about the documentExtract Structured AI Fields: If using the Extract Structured AI, what prompt to send the AI about the document. For more information on the expected format of this prompt check our developer documentation.Client IDClient SecretBox Subject TypeBox Subject IDFolder IDRecursiveClient IDClient SecretBox Subject TypeBox Subject IDFolder IDRecursiveThe Box Data Extract source connector supports the following sync modes:
| Feature | Supported? |
|---|---|
| Full Refresh Sync | Yes |
| Incremental Sync | No |
| SSL connection | Yes |
| Namespaces | No |
| Version | Date | Pull Request | Subject |
|---|---|---|---|
| 0.1.13 | 2025-05-10 | 59315 | Update dependencies |
| 0.1.12 | 2025-04-26 | 58737 | Update dependencies |
| 0.1.11 | 2025-04-19 | 58294 | Update dependencies |
| 0.1.10 | 2025-04-12 | 57611 | Update dependencies |
| 0.1.9 | 2025-04-05 | 57154 | Update dependencies |
| 0.1.8 | 2025-03-29 | 56593 | Update dependencies |
| 0.1.7 | 2025-02-13 | Adding Box Extract Structured AI stream | |
| 0.1.6 | 2025-02-13 | Adding Box Extract AI stream | |
| 0.1.5 | 2025-02-13 | Adding Box Ask AI stream | |
| 0.1.4 | 2025-02-12 | Initial release by @BoxDevRel |