Skip to main content

My Projects

Table of Contents

Currently Working On
#

  • JSGF library for basic communicative tasks
  • Document preprocessing (chunking, conversion, metadata, etc) cli
  • Data labeling tui
  • Semantic/knowledge graph tools
  • Golang rewrites/APIs for existing projects

My Personal Favorites
#





Skill Showcase
#

Bayesian modeling, MCMC methods
Data analysis
Clustering
NLP, embedding methods
Data visualization
Research, scientific writing

All Projects
#


Priors

·349 words·2 mins
An experiment combining pretrained and bag of words embedding approaches for text classification

Embs

·89 words·1 min
A project to provide tools streamlining sentence embedding or clustering techniques

Radicals

·135 words·1 min
Playing with different embedding techniques & kanji

Michelin

·155 words·1 min
Exploration of Michelin star restaurants

Manyogana

·673 words·4 mins
万葉仮名 & 漢数字 transliteration functions & RShiny app

Hanakotoba

·164 words·1 min
Exploring 花言葉 in Japanese and other literary corpora

Movies

·139 words·1 min
A dataviz/exploration dashboard with the 10,000 Movies dataset

Kyoto

·82 words·1 min
Restaurants and stations in Kyoto

Yoji

·182 words·1 min
Generating 四字熟語, a.k.a. 4-character Japanese idioms

eBook Tokenizer

·169 words·1 min
Add spaces between Japanese words in eBooks to work with Kindle WordWise