Create multiple projects and track their progress independently
Easily experiment with a new model/algorithm by training it in CleanML and comparing its performance with the other models in your project.
Create multiple projects and track their progress independently
Easily experiment with a new model/algorithm by training it in CleanML and comparing its performance with the other models in your project.
Gain insights about training & test data, distribution of annotated entities, and decide how to curate more data for better accuracy
Gain insights about training & test data, distribution of annotated entities, and decide how to curate more data for better accuracy
Analyze your data annotations, identify missing, incorrect & multi-classifications, and improve your dataset quality
Speed up and improves the annotation process with CleanML's helpful features, all from a single window
Speed up and improves the annotation process with CleanML's helpful features, all from a single window
Features for annotators include
Experiment with multiple algorithms using different libraries irrespective of them being on GPU, CPU, on-prem or cloud
Experiment with multiple algorithms using different libraries irrespective of them being on GPU, CPU, on-prem or cloud
Algorithms are easy to scale and replicate via configuration files and they are run on Docker containers
Easily experiment with a new model/algorithm by training it in CleanML and comparing its performance with the other models in your project.
Identify and fix data & data-classification issues, and perform drill-down analytics on the dataset. Gain insights about data classified across multiple categories/classes, missed classifications and anomalies in classifications.
Read moreWorkbench provides useful features including annotating text, entity renaming across records, editing content in-place, tag suggestions, auto-labeling suggestions, previous classifications and an ability to add a custom dictionary.
Read moreCleanML does data versioning by default. This helps with training reproducibility. CleanML also provides capability to compare a model training with a future version of the same model, with a model that uses a different algorithm and even with a model deployed in production.
Train and compare models of different algorithms with the same dataset. CleanML versions all the training and helps compare between versions of training and data. The ability to perform comparison of both models and data at a record level significantly increases your productivity.
Get labeling suggestions based on the trained algorithms which can assist the annotators and speed up new data annotations.
Import data in CoNLL-2003, IOB (IOB1/2, BILOU, IOBES), JSONL, and txt. Import data from UI, API, command-line and Singer Taps. Also export annotated data to multiple data formats via command-line.
Questions, best-practices and brainstorming, join us on our discord.