Doc Check
OCR - Document scanning
OCR or Optical Character Recognition is used to identify and process documents. After advanced training, an AI can recognize handwriting and numeric characters.
More precisely, what is OCR?
OCR is a technology that converts images (jpg, png, pdf files, etc.) containing printed, handwritten or typed text into usable digital text.
OCR makes textual content accessible for searching, editing and data extraction.
Use
How does Datakeen use OCR?
Our artificial intelligence uses optical character recognition for complementary purposes. Typing documents and extracting the information they contain.
Character reading
OCR correctly identifies handwritten and typescript characters.
Document typing
Identify the type of document in just a few moments by reading the title.
Information extraction
The information identified by the OCR is extracted in the form of a text file.
The power of OCR to read handwriting
Text processing
What happens once the information has been extracted?
Once the OCR has done its job, it's the turn of our processing tools to take over. The AI that accompanies them is able to separate the information. This enables us to build a file structured in key-values.
Key values
We have trained the Datakeen AI on many different types of document. This enables it to recognize the keys and identify the corresponding values. This is a necessary capability for reformatting occluded files.
Datakeen Studio
Your document is not known to our AI? You can still benefit from optimal processing by first training the AI on the fields you wish to extract. You can do this from the Datakeen Studio.
Results display
View your digitized document on our platform or via API
Once your document has been processed, you can view it by logging on to our platform. You can then choose to export the file in a structured format (csv, excel, etc.). You can also connect your EDM or CRM tool to our API.
Using the Datakeen platform
Log on to our platform using the login details supplied by Datakeen. You'll be able to see all the processes we've carried out.
Exporting a structured file
Go to the Datakeen platform, then to the analysis in question. An export button lets you choose the format before downloading the document.
Connect your EDM / CMR solution
Datakeen connects natively with certain EDM (Electronic Document Management) and CRM (Customer Relationship Management) tools.
Building an API connection
Would you like to set up an API connection? Datakeen can provide you with API keys and full documentation.
Take advantage of Datakeen OCR today
Would you like to implement a document processing solution? Contact our experts for a demonstration.
Our latest articles
What is document scanning? Definition and applications
OCR is the acronym for Optical Character Recognition. A...
France 2030: Datakeen - A sovereign alternative
As part of its France 2030 program, France is exploring local alternatives to international giants such as Microsoft....
OCR & AI: document management reinvented
Document management is an essential part of most businesses and organizations. Whether it's for processing...
Frequently asked questions
Document scanning
OCR and document ocrization are widely used in many applications. Common examples include :
- Document scanning: Convert paper documents into digital files for better management and retrieval.
- Character recognition on invoices and receipts: Extraction of key information for accounting and expense management.
- Automatic translation: OCR makes it easy to translate printed documents into another language.
- Archiving and document management: Storage and organization of paper documents in digital form.
- Accessibility: Make printed documents accessible to the visually impaired by converting text into audio format.
OCR (Optical Character Recognition) is a technology for converting physical documents or text images into editable text. Document OCR is the process of converting visual data into digital text. This means that you can take an image of a printed or handwritten document and use OCR software to extract the text from that image, enabling you to copy, edit or search it just like any other text.