Legal AI: How We Improved AI Text Classification By 30%?

Domain: AI in Legal
Services: Custom AI Solutions

LegalForce Trademarkia is one of the biggest Trademark search engines. They were looking to improve the text classification of trademark applications into 45 different primary categories and 40,000 sub-categories leveraging AI

These categories are typically manually entered by attorneys, a very slow process, making it an ideal place for automation with Artificial Intelligence and Natural Language Processing (NLP) techniques.

ai text classification
Trademarkia Search Engine

Problematic Classification

Unfortunately, Trademarkia’s existing AI text classification system which used a linear classifier performed poorly, where trademarks were grossly misclassified at the primary category level and it could not make a sensible classification at the sub-category level.

Our Solution

1. Understanding the problem

Our first step was to understand the exact solution used by our client, the size of data involved, and problems in their dataset such as sparsity issues. Upon diagnosis, we realized that a traditional classification approach was not the best way to tackle the problem due to issues in the data and the massive number of sub-categories.

2. Solution Development

Once we determined the problem, we were able to design and develop an alternative solution leveraging a highly efficient information retrieval approach using Python, Gensim, and ElasticSearch.

During development, we preprocessed the data adequately and developed a full pipeline (client IP) where any valid user input would result in a logical categorization both at the primary category and sub-category levels.

3. Evaluation & Delivery

In addition to the quantitative evaluation of our approach, to ensure that the results made sense and met the needs of our client, we manually evaluated ~50 test cases. Finally, the full solution was delivered to our client for integration.

Results

As a result of our partnership, LegalForce Trademarkia was able to see  ~30% improvement of text classification accuracy at the primary category level and was able to make automatic classification at the sub-category level which they previously were not able to. Also, because our algorithm was clear and efficient, LegalForce Trademarkia was able to easily integrate the pipeline into their workflow.

ai text classification case study improvement
Text classification accuracy improvement

See Also

Need help on a similar problem? Get in touch to speak to our experts.

Scroll to Top