Skip to main content
  1. My Projects/

Radicals

·135 words·1 min·



ryancahildebrandt/radicals

Playing with different embedding techniques & kanji

HTML
0
0


Kanji in (Vector) Space
#


Open in gitpod

Binder

This project contains 0% LLM-generated content

Purpose
#

This is a little mini project looking at different embedding techniques used on kanji, partially to see what happens and partially to brush the dust off my julia lang skills. Here, I use kanji radicals and readings to construct simple embedding vectors, and overlay them with FastText semantic embeddings. This project is not intended to produce useful techniques for representing meaning in Japanese, so take the results with a grain of salt.


Dataset
#

The dataset used for the current project was pulled from the following:

  • Kanji dataframe for radicals, meanings, readings, etc

Outputs
#

  • The Pluto.jl notebook for the full code and clustering results
  • The main interactive plot of the novel embeddings overlaid

Ryan Hildebrandt
Author
Ryan Hildebrandt
Data Scientist, etc.