Projects

Grizzen is a small MUD-like game server that uses the websocket protocol instead of telnet for communications. Besides using websockets, this project also differs from traditional MUDs in that the text is almost entirely generated on the client (i.e. in the browser). The server’s role is to maintain game state and pass messages between the appropriate clients. This will allow the server logic to remain relatively simple. At this time, the game includes most of the basic commands one would expect from a MUD, as well as a rich set of social/roleplaing commands.

~422~ About 2 minutes Read on...

Still a work in progress, LibTernary is a library for working with vectors of balanced ternary (-1, 0, 1) digits, or trits. I usually omit the “balanced” and just refer to them as ternary vectors, because ternary is usually only interesting when it’s balanced ternary. “What’s the point?” you ask? I’m glad you asked. I refer you to An Introduction to Random Indexing by Magnus Sahlgren. The upshot is that mapping text (or perhaps other data) to ternary vectors leads to a very simple and computationally fast way of approximating a vector space model à la latent semantic analysis.

~149~ About 1 minutes Read on...

This is a library for ad hoc tokenization of English text. Extract tokens from a body of text for use with NLP tools or statistical analysis. The library currently works well with most simple cases. I do plan to extend it a bit for flexibility, but not at significant cost to performance. There are other tokenizers out there with more bells and whistles, plus better multilingual support. Usage To use, simply pass your input string to the TextTokenizer constructor and then call the Tokenize() method.

~229~ About 2 minutes Read on...


 MENU