Tldrstory: AI-powered Understanding of Headlines and Story Text
A framework for AI-powered understanding of headlines and text content related to stories.
zero-shot-learning text-similarity similarity-search streamlit fastapi txtai transformers huggingface tldrstory natural-language-processing code demo

tldrstory is a framework for AI-powered understanding of headlines and text content related to stories. tldrstory applies zero-shot labeling over text, which allows dynamically categorizing content. This framework also builds a txtai index that enables text similarity search. A customizable Streamlit application and FastAPI backend service allows users to review and analyze the data processed.

Election 2020 Disclaimer: This is for demonstration purposes only. There will be errors based on the nuances of the English language and personal perception. Users should judge the accuracy of results for themselves. All titles and links are displayed as is from Reddit. Users agree to use their own discretion and accept responsibility for the links they click.

This application categorizes 2020 US Presidential Election Reddit post titles by topic, objectivity and potential bias towards a candidate. Data is pulled via the Reddit API using a series of queries for popular link posts. In many but not all cases, the titles of these posts are headlines for news articles.

Reddit post titles are labeled using two categories:

  • Objectivity
  • Bias towards a candidate

Generally, as the objectivity score increases, the level of bias should decrease.

Tech stack A zero-shot classifier, backed by a large general language model with no labeled data, is used to categorize topics, objectivity and potential bias. Additionally, a txtai index enables ad hoc similarity searches against the data.

The following libraries are used:

  • txtai
  • Transformers
  • Sentence Transformers
  • Streamlit
  • FastAPI

Don't forget to tag @neuml , @davidmezzetti in your comment, otherwise they may not be notified.

Authors community post
Applying machine learning to solve everyday problems
Founder/CEO at NeuML — applying machine learning to solve everyday problems. Previously co-founded and built Data Works into a successful IT services company.
Share this project
Similar projects
Generalized Zero & Few-Shot Transfer for Facial Forgery Detection
Deep Distribution Transfer (DDT), a new transfer learning approach to address the problem of zero and few-shot transfer in the context of facial forgery.
Zero-Shot Learning for Text Classification
A visual summary of “Train Once, Test Anywhere” paper for zero-shot text classification
Zero Shot Topic Classification
Bart with a classification head trained on MNLI.
Exploring Knowledge Captured in Probability of Strings
An exploration of simple knowledge captured by language models with code examples