Evaluating tokenizers with unit tests