Skip to content Skip to sidebar Skip to footer

Scikit-learn, Add Features To A Vectorized Set Of Documents

I am starting with scikit-learn and I am trying to transform a set of documents into a format on which I could apply clustering and classification. I have seen the details about th

Solution 1:

You could use the DictVectorizer for the extra categorical data and then use scipy.sparse.hstack to combine them.


Post a Comment for "Scikit-learn, Add Features To A Vectorized Set Of Documents"