KubroTM - The Information Engine

Aggregate, filter and extract value from unstructured data (text) from the web, filings, tweets, your documents or emails.

Achieve significant time savings through automation of your data collection operations.

Gain an edge in research for investments, editorial, competitive and customer intelligence, and lead generation.

Cloud and API Delivery. AI-Powered. No Code. Any Language. Customized.

KubroTM has been in development since 2016 - originally for internal use and then deployed commercially with enterprise clients since 2018, in the US, UK, and Asia
Deployed for data collection and research in domains ranging from real estate, policy research, to carbon and digital asset markets
Demonstrated 30-60% time savings in enterprise use: an edge in market intelligence with the search and alerts engine
“It’s about taking what your team of 10 does in 10 days and helping them get it done in one day, empowering the human analysts with technology”

The Proposition

The Edge for Your Data, Research, and Media Operations - for your proprietary database build and information products, your competitive and market intelligence systems, and lead generation.
Signallium China Property Product Screenshot

Extract Value from Unstructured Text

Do your data, research or editorial operations rely on unstructured data inputs from the web, filings, Twitter, or multiple internal sources and email, with a lot of manual work involved?

If so, we can give your operations a major productivity edge.

Signallium China Property Product Screenshot

You will build your databases and intel faster, save your analysts time, with fewer misses and frustrations, and better pickups from local sources.

How? Through automation of aggregation and filtering of unstructured information (data), while utilizing workflow tools in data analysis and supervision of the data operations.

Signallium China Property Product Screenshot
Signallium China Property Product Screenshot

Unique Specialist Focus

KubroTM is not a generic solution but a specialist system for data and research businesses. Originally developed for our own applications in real estate research.

No code. Any language. Easy to deploy yet highly configurable. Enterprise SaaS model. Serviced with a love for data. Quietly since 2017.

Use Cases

Proven commercial deployments with data companies, research and investment firms, editorial and media operations
person icon
Data Collection for Proprietary Databases
You sell proprietary data or databases. Your analysts rigorously collect information from sources on the web, Twitter, SEC filings, or emails and other documents. To build your edge, you want greater efficiency in data operations, automating manual tasks and focusing the team on value-add work.
person icon
Research and Market Intelligence
You believe in an information edge, in enhancing your competitive and market research or lead generation by capturing intel from a variety of sources. But you want the system to be customized to your specific business cases and under your team’s direct control.
person icon
Media and Editorial
Sources pump out text continuously; your overflowing inbox is only a tiny fraction of what’s available - how do you find the usable and important needle in the proverbial haystack? Your staff should concentrate on valuable work - providing the analysis and the context, We can automate the selection and the filtering.
person icon
Data Collection for Proprietary Databases
You sell proprietary data or databases. Your analysts rigorously collect information from sources on the web, Twitter, SEC filings, or emails and other documents. To build your edge, you want greater efficiency in data operations, automating manual tasks and focusing the team on value-add work.
person icon
Research and Market Intelligence
You believe in an information edge, in enhancing your competitive and market research or lead generation by capturing intel from a variety of sources. But you want the system to be customized to your specific business cases and under your team’s direct control.
person icon
Media and Editorial
 
Sources pump out text continuously; your overflowing inbox is only a tiny fraction of what’s available - how do you find the usable and important needle in the proverbial haystack? Your staff should concentrate on valuable work - providing the analysis and the context, We can automate the selection and the filtering.

Our Approach, Philosophy, Ethics

What makes this unique?
Map Icon

ENGINEERED FOR DATA, RESEARCH, AND MEDIA CLIENTS

Unlike generic platform solutions, we specialize in deployments for data companies, research, and media firms, with KubroTM originally developed for our research business of a sister venture Real Estate Foresight.

News Icon

CUSTOMIZATION TO GAIN COMPETITIVE EDGE

Your competitive edge won’t come from standardized products. Hence our focus on customization / configuration of KubroTM for specific use case. In some domains, we partner with clients on an exclusive basis.

AI Icon

KUBROTM AIDE – AI DATA EXTRACTION AND ROBO ANALYSTS

With the integration of Large Language Models, you can achieve much greater time savings with automated extraction of specific data points from text documents. We can also create custom Robo-Analysts for you.

Filter Icon

NO CODE, ANY LANGUAGE

Easy to deploy, test, change, without the need for coding skills, yet with flexibility to apply more advanced AI models and Regex. Deployed in AWS. Straightforward RESTFull APIs. Text analysis works in any language incl. Chinese.

News Icon

COMPLETE WORKFLOW

From data ingest and filtering to the supervision of the "modern data manufacturing plant", we go far beyond single component RPA or search and crawling tools, with a more complete workflow solution and management tools.

Gear Icon

ETHICS

We provide full transparency on system performance and observe the best practices in handling publicly available information, recommending customers to do the same, regarding e.g. limits to web scraping.

How It Works - The Framework

No code, easy to deploy and test, developed with clients
SOURCES
Set up your sources of unstructured textual data from the Web, Filings, Twitter, or Documents and Email. Automate the ingest and aggregation - let the robots do the repetitive work.
person icon
FILTERING
Apply deterministic (keywords, Regex) rules and/or train your own AI (neural network) model to classify / tag and filter the information.
person icon
ANALYSTS
Your (human) analysts focus on value-add work, such as cross-checks and database entry, research, content curation and publishing.
person icon
APIs
Feed the filtered outputs into your applications or dashboards, with extensive RESTFull APIs, both internally and for your clients.

person icon
MANAGEMENT TOOLS
Set up the team, leader roles, performance metrics and tools to optimise the operations of your "data manufacturing plant", within the M.A.S.T.E.R.S. Framework: Model-Aggregate-Standardize-Tag-Evaluate-Release-Supervise
SOURCES
Set up your sources of unstructured textual data from the Web, Filings, Twitter, or Documents and Email. Automate the ingest and aggregation - let the robots do the repetitive work.
person icon
FILTERING
Apply deterministic (keywords, Regex) rules and/or train your own AI (neural network) model to classify / tag and filter the information.
person icon
ANALYSTS
You can also easily curate and publish filtered content, both manually and as robo-reports, to your clients.
person icon
APIs
Deliver the filtered content to human analysts for their value-add work or through API into your applications.

person icon
MANAGEMENT TOOLS
Set up the team, leader roles, performance metrics and tools to optimise the operations of your "data manufacturing plant", within the M.A.S.T.E.R.S. Framework: Model-Aggregate-Standardize-Tag-Evaluate-Release-Supervise

InfoBesity Blog

Finding The Lost Research Chart Gems With LLMs...

Finding The Lost Research Chart Gems With LLMs...

20 August, 2024

Multimodal LLMs can certainly help sort out and search through the family photos but they can also be useful for research businesses. …

From RAGs To Regex

From RAGs To Regex

19 August, 2024

RAG (Retrieval Augmented Generation) continues to be all the rage in LLM deployments. …

Multimodality in LLMs might open up new opportunities for data extraction.

Multimodality in LLMs might open up new opportunities for data extraction.

1 August, 2024

At Robotic Online Intelligence (ROI), we've been testing the use of multimodal LLMs for data extraction from the images of the tables (as opposed to the 'traditional' ways), charts, and other non-text formats. …

InfoBesity Blog

Finding The Lost Research Chart Gems With LLMs...

Finding The Lost Research Chart Gems With LLMs...

20 August, 2024

Multimodal LLMs can certainly help sort out and search through the family photos but they can also be useful for research businesses. …

From RAGs To Regex

From RAGs To Regex

19 August, 2024

RAG (Retrieval Augmented Generation) continues to be all the rage in LLM deployments. …

Multimodality in LLMs might open up new opportunities for data extraction.

Multimodality in LLMs might open up new opportunities for data extraction.

1 August, 2024

At Robotic Online Intelligence (ROI), we've been testing the use of multimodal LLMs for data extraction from the images of the tables (as opposed to the 'traditional' ways), charts, and other non-text formats. …