Anomalo now delivers AI-powered monitoring of unstructured text
Anomalo, the complete data quality platform company, is expanding its platform that monitors the quality of structured data in data warehouses and data lakes to monitor unstructured text—making it possible for enterprises to discover, curate, leverage and ingest high volumes of text data without the risk of using low quality data. This new feature is currently in private beta.
Organizations are implementing generative AI and ingesting unstructured text for the purposes of model training, fine tuning, and Retrieval Augmented Generation (RAG) at a volume and velocity previously unseen, according to the company. As a result, organizations need to be able to identify and resolve quality issues with such data before it gets incorporated into generative AI models and impacts their performance.
With Anomalo’s new unstructured capability, unstructured text documents can be curated and evaluated for data quality around various document and document collection characteristics, including document length, duplicates, topics, tone, language, abusive language, PII, and sentiment.
Users can quickly evaluate the quality of a document collection and identify issues in individual documents, reducing the time needed to curate, profile, and leverage high-value unstructured text data.
“It’s been well known that higher quality data leads to better data products, including traditional dashboards and machine learning models. The same is true in the world of generative AI, where the quality of the text used to fine-tune or prompt the model via RAG could be the difference between a high performing application and one that is at best underwhelming and at worst, a privacy and compliance risk,” said Elliot Shmukler, co-founder and CEO of Anomalo. “We’re supporting data teams in using high quality data for all of their critical initiates and with our new unstructured text monitoring capability, to support their Generative AI efforts as well.”
Anomalo’s new unstructured text capability expands its platform that uses AI to automatically detect data issues and understand their root-causes before anyone else, allowing teams to resolve any hiccups with their data before making decisions, running operations, or powering models, according to the company.
For more information about this news, visit www.anomalo.com.