Skip to main content

My Projects

Table of Contents

Currently Working On
#

  • JSGF library for basic communicative tasks
  • Document preprocessing (chunking, conversion, metadata, etc) cli
  • Data labeling tui
  • Semantic/knowledge graph tools
  • Golang rewrites/APIs for existing projects

My Personal Favorites
#





Skill Showcase
#

Bayesian modeling, MCMC methods
Data analysis
Clustering
NLP, embedding methods
Data visualization
Research, scientific writing

All Projects
#


C2G

·1208 words·6 mins
Condensing text corpora to context free grammars

Readings

·511 words·3 mins
Read/reading list

Aozora Reibun

·331 words·2 mins
Japanese language study email service

Cluster Benchmarks

·1305 words·7 mins
Embedding and clustering workloads in task queues

GSGF

·1993 words·10 mins
Generate natural language expressions from JSGF

NLT

·1790 words·9 mins
Natural language representations of tabular data

SimSort

·497 words·3 mins
Sorting texts by semantic similarity

Topics

·139 words·1 min
Experiments and utilities for text topic extraction using decision trees

Aozora Corpus

·497 words·3 mins
Centuries of Japanese literature, all in one convenient csv