Founded 2014 | HQ New York, NY | 300 employees (approx.) | $40M revenue (est.)

Hyperscience tries to take a more human-centric approach to document automation than its peers by balancing and recognizing the need for human expertise rather than unrealistically claiming to or attempting to automate 100% of work activities. There is no question that this is a solid and highly advanced IDP platform.

The Company

Hyperscience was founded in 2014 in Sofia, Bulgaria, and is now headquartered in New York City while maintaining a substantial presence in Sofia. Currently led by Interim CEO Charlie Newark-French, the company employs approximately 300 people and has revenues of around $40 million. It has raised $300 million to date and is at Series E funding.

The Technology

Hyperscience is best known for developing and launching a modern approach to optical character recognition (OCR). However, it does much more than provide OCR functionality. Hyperscience offers an advanced platform to extract and process unstructured information, and additionally offers a specific application for intelligent document processing (IDP).

The Hyperscience platform consists of a series of pre-defined, yet customizable, modules or “Code Blocks.” Each block contains specialized machine learning (ML) components that can be linked to create a customized end-to-end document processing solution. Some of the blocks available are standard; for example, blocks to input, classify, identify, transcribe, route, etc. Others are highly specific, such as a “Box Folder Listener” that monitors and, when needed, retrieves files from Box folders. Others simply connect to systems such as Salesforce, Pega, or Bizagi. From this still-growing library of Code Blocks, a user initially works within a module called Flow Studio. As the name suggests, this is a visual interface to build simple workflows consisting of a series of Code Blocks (see Figure 1). To give Hyperscience credit, this is an intuitive and easy-to-use design environment.

In summary, this is a core platform that typically runs on VMware (either in the cloud or on-premises) and incorporates pre-trained AI components including extensive deep learning elements alongside basic file storage. Hyperscience can integrate with a wide range of data sources via pre-built connectors and APIs. It supports a range of different languages including Korean, Arabic, and common European languages, and comes with extensive and detailed real-time reporting capabilities. Layered atop and leveraging the core platform are a range of pre-defined building blocks that can be accessed and sequenced in the Flow Studio environment. In our analysis, this is a simple and very effective approach, as traditional IDP systems can be highly complex to configure, code, and manage.

Here’s the thing, though. All IDP systems do the same thing, or at least have the same goal, which is to extract, understand, and process unstructured documents. As a result, most are, under the covers, similar both in approach and underlying technology. Hyperscience, though, differs noticeably from the pack in its conceptual approach to IDP as well as its extensive use of AI. What is this conceptual approach? Almost all modern IDP systems (indeed, all back-office AI products) involve Human in the Loop (HITL) whereby if the AI is unsure of something, it will flag it for a human to correct or clarify. Once the human does so, the system learns and uses that knowledge for future similar exceptions. Hyperscience embraces the use of HITL far more extensively than others we have seen. Rather than simply utilizing HITL to correct errors, Hyperscience is built to ensure that humans and the machine work collaboratively.

The company uses the term “Human Centered Automation,” and this is more than a catchy marketing term. Hyperscience recognizes that though AI and machines in general can work at speed, at volume, and with a high degree of accuracy, there will always be break points and limitations, for example when a machine reads handwriting. Hyperscience focused much of its early R&D on training its system through deep learning to recognize handwriting, and it does as good a job as any we have seen. But common sense tells us that no machine is going to read handwriting correctly all the time. Likewise, in many IDP use cases, a variety of different forms of documentation and language may be processed in parallel; for example, machine-generated PDFs alongside handwritten notes and human-generated documents. The Hyperscience goal is to bridge these divides and to normalize all the information into a form that is equally readable by a human and a machine. For complex document processes in areas such as insurance, healthcare, or financing, that strategy makes Hyperscience a much more approachable and human-centric system than one solely focused on machine processing. In our estimation this is a good approach that makes any potential IDP deployment less risky and easier to execute and scale than more traditional methods.

In our analysis the two obvious differentiators for Hyperscience, in what is otherwise a crowded marketplace, are these: 1) Compared to many of its competitors it is very easy to use and deploy, and 2) it does a good job with reading/decoding handwriting, typically a weak area for IDP products. The biggest differentiator, though, is its human-centric approach to automation.

Figure 1
Hyperscience UI with Code Blocks

Our Opinion

At Deep Analysis we look at a lot of IDP systems, and modern systems are leagues ahead in terms of ease of use and accuracy compared to systems of five or 10 years ago. Yet in truth, there is not a lot of difference between systems. Hyperscience is different, though, as it tries to take a more human-centric approach to document automation than its peers by balancing and recognizing the need for human expertise rather than unrealistically claiming to or attempting to automate 100% of work activities. That’s a difference that can only be seen and understood in practice, and that alone merits a closer look and demonstration.

Advice to Buyers

Hyperscience has come a long way in a relatively short time, and there is no question that this is a solid and highly advanced IDP platform. It is most suitable for larger deployments where volume, speed, and accuracy are at a premium. As such, if you are in the market for a new IDP system we recommend you consider Hyperscience on your shortlist and ask for a demonstration utilizing your own data.

SOAR Analysis


  • Accurate and scalable document capture and processing
  • Exceptionally easy to use system


  • To be the leader in IDP
  • To challenge and replace overly complex RPA deployments


  • Unseat legacy IDP vendors
  • Expand further though building more industry-specific applications


  • Raised $300 million to date
  • Blue chip customers such as TD Ameritrade

Attribution-NonCommercial-NoDerivatives 4.0 International
CC BY-NC-ND 4.0 license

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.

Work Intelligence Market Analysis 2024-2029