Building a Conditional Random Fields Tagger for Academic Citations

Over the weekend, I built a system that identifies, parses and formats unstructured academic citations. The system is not running anymore, because I didn't want to pay for hosting, but the source code is still available on github.

It can take a raw string like "Wang, L.-P.; Titov, A …

Read more →

Introducing Osprey

Last week, I started work on a new open source software project whose goal is to streamline hyperparameter optimization for machine learning algorithms. The tool is called osprey, and it's available on github, pypi, and readthedocs. It integrates closely with scikit-learn.

Osprey is designed to make hyperparameter optimization as easy …

Read more →