Skip to main content
  1. My Projects/

Topics

·139 words·1 min·

Tools
Python
Clustering
NLP


ryancahildebrandt/topics

HTML
0
0


Explainable Topic Extraction with Decision Trees
#


Open in gitpod
This project contains 0% LLM-generated content

Purpose
#

This project is a collection of a couple different experiments and utilities for topic extraction, including:

  • Comparing popular topic extraction libraries on different kinds of documents
  • A simple implementation of decision trees to explain group membership for clustered texts
  • Some additional utilites for explaining decision tree rules based on bag of word embedding features
  • Plotting utilities for derived clusters and/or their assigned topics

Datasets
#

Topic Extractors
#


Outputs
#

  • The results report outlining approach and findings
  • The outputs folder, which contains breakdowns of dataset topics, topic evaluations, and tests of group differences
  • Decision tree topic extraction implementation

Ryan Hildebrandt
Author
Ryan Hildebrandt
Data Scientist, etc.