Scikit-learn, Add Features To A Vectorized Set Of Documents
I am starting with scikit-learn and I am trying to transform a set of documents into a format on which I could apply clustering and classification. I have seen the details about th
Solution 1:
You could use the DictVectorizer
for the extra categorical data and then use scipy.sparse.hstack to combine them.
Post a Comment for "Scikit-learn, Add Features To A Vectorized Set Of Documents"