Home 5 Capabilities 5 OCR Document scanning

Doc Check

OCR - Document scanning

OCR or Optical Character Recognition is used to identify and process documents. After advanced training, an AI can recognize handwriting and numeric characters.

Document scanning / OCR

More precisely, what is OCR?

OCR is a technology that converts images (jpg, png, pdf files, etc.) containing printed, handwritten or typed text into usable digital text.

OCR makes textual content accessible for searching, editing and data extraction.

Use

How does Datakeen use OCR?

Our artificial intelligence uses optical character recognition for complementary purposes. Typing documents and extracting the information they contain.

OCR optical character recognition

Character reading

OCR correctly identifies handwritten and typescript characters.

document typing

Document typing

Identify the type of document in just a few moments by reading the title.

OCR information extraction

Information extraction

The information identified by the OCR is extracted in the form of a text file.

The power of OCR to read handwriting

Do you have handwritten documents that you'd like to read more easily? OCR is the ideal solution for you! Whether you're archiving notes, transcribing old letters or processing handwritten forms, OCR makes managing your handwritten documents a whole lot easier.

This technology makes reading and processing handwritten documents child's play.

OCR reading handwritten documents

Text processing

What happens once the information has been extracted?

Once the OCR has done its job, it's the turn of our processing tools to take over. The AI that accompanies them is able to separate the information. This enables us to build a file structured in key-values.

OCR key-value association

Key values

We have trained the Datakeen AI on many different types of document. This enables it to recognize the keys and identify the corresponding values. This is a necessary capability for reformatting occluded files.

Datakeen Classification Studio

Datakeen Studio

Your document is not known to our AI? You can still benefit from optimal processing by first training the AI on the fields you wish to extract. You can do this from the Datakeen Studio.

Results display

View your digitized document on our platform or via API

Once your document has been processed, you can view it by logging on to our platform. You can then choose to export the file in a structured format (csv, excel, etc.). You can also connect your EDM or CRM tool to our API.

Datakeen OCR platform

Using the Datakeen platform

Log on to our platform using the login details supplied by Datakeen. You'll be able to see all the processes we've carried out.

Structured file

Exporting a structured file

Go to the Datakeen platform, then to the analysis in question. An export button lets you choose the format before downloading the document.

CRM or EDM connection

Connect your EDM / CMR solution

Datakeen connects natively with certain EDM (Electronic Document Management) and CRM (Customer Relationship Management) tools.

API integration

Building an API connection

Would you like to set up an API connection? Datakeen can provide you with API keys and full documentation.

Take advantage of Datakeen OCR today

Would you like to implement a document processing solution? Contact our experts for a demonstration.

Partner France 2030   Third-party liability Provigis   Third-party liability Provigis

Our latest articles

Frequently asked questions

Document scanning

OCR and document ocrization are widely used in many applications. Common examples include :

  1. Document scanning: Convert paper documents into digital files for better management and retrieval.
  2. Character recognition on invoices and receipts: Extraction of key information for accounting and expense management.
  3. Automatic translation: OCR makes it easy to translate printed documents into another language.
  4. Archiving and document management: Storage and organization of paper documents in digital form.
  5. Accessibility: Make printed documents accessible to the visually impaired by converting text into audio format.

OCR (Optical Character Recognition) is a technology for converting physical documents or text images into editable text. Document OCR is the process of converting visual data into digital text. This means that you can take an image of a printed or handwritten document and use OCR software to extract the text from that image, enabling you to copy, edit or search it just like any other text.