Skip to content Skip to sidebar Skip to footer
Showing posts with the label Nlp

Scikit-learn, Add Features To A Vectorized Set Of Documents

I am starting with scikit-learn and I am trying to transform a set of documents into a format on wh… Read more Scikit-learn, Add Features To A Vectorized Set Of Documents

How To Normalize Similarity Measures From Wordnet

I am trying to calculate semantic similarity between two words. I am using Wordnet-based similarity… Read more How To Normalize Similarity Measures From Wordnet

Repeating Entity In Replacing Entity With Their Entity Label Using Spacy

Code: import spacy nlp = spacy.load('en_core_web_md') #read txt file, each string on its o… Read more Repeating Entity In Replacing Entity With Their Entity Label Using Spacy

Fine-tune Bert For Specific Domain (unsupervised)

I want to fine-tune BERT on texts that are related to a specific domain (in my case related to engi… Read more Fine-tune Bert For Specific Domain (unsupervised)

How To Automatically Label A Cluster Of Words Using Semantics?

The context is : I already have clusters of words (phrases actually) resulting from kmeans applied … Read more How To Automatically Label A Cluster Of Words Using Semantics?

Nltk Lemmatizer, Extract Meaningful Words

Currently, I am going to create a machine learning based code that automatically maps categories. I… Read more Nltk Lemmatizer, Extract Meaningful Words

Ner Training Using Spacy

When running a train on an empty NER model, should I include only labeled data (data that contain n… Read more Ner Training Using Spacy

How Can I Create My Own Corpus In The Python Natural Language Toolkit?

I have recently expanded the names corpus in nltk and would like to know how I can turn the two fil… Read more How Can I Create My Own Corpus In The Python Natural Language Toolkit?