A Working POS tagger in tensorflow.js

Quite a tricky thing to debug, but the full POS tagger is now working. You can find the code here. The challenge of encoding and decoding the data on the client was quite tricky. The JavaScript and layout is still very basic, but sufficient for now. 

I have also improved the model to be about 96% accurate from the previous 92%. This involved adding a dense layer, dropout and training on more data. The WSJ dataset is very large, so I am still only using a fraction of this 50k out of 850k.

I also removed a bunch of non POS tags. Things like punctuation. They may be useful for training some models, but mine just relied on words. 

Next up is to train a Named Entity Recognition model using these Parts of Speech as part of the input features. 

Comments

Popular posts from this blog

Execute Jupyter notebooks line by line in VS Code

Using TensorFlow Serving

Text Summarisation with BERT