while POS Tagging assigns tags to each word, NER Tagging tags the category of usage of multi-word spans. NER Tagging needs to label spans of text, which means that there is ambiguity in type. BIO Tagging BIO Tagging will tag each word: where B begins a span, I, is inside a span, and O outside a span. So tags per word still apply, but we can extract span information as well. (job - gender + gender ) = job (captial - country + country) = captial