Ready for Dataiku 9? Try out the Crash Course on new features! GET STARTED

Predefined code Notebooks Python Topic modeling

Level 1
Predefined code Notebooks Python Topic modeling


I'm trying to use the Predefined code in python 2.7 but whe I run the Setup and dataset loading code , I see an error

No module named pyLDAvis.sklearn ,

Do you know how to solve this?




Setup and dataset loading 

First of all, let's load the libraries that we'll use.

This notebook requires the installation of the pyLDAvis package. See here for help with intalling python packages.

In [1]:

Populating the interactive namespace from numpy and matplotlib

ImportErrorTraceback (most recent call last)
<ipython-input-1-f0e756363047> in <module>()
10 from sklearn.decomposition import LatentDirichletAllocation,NMF
---> 11 import pyLDAvis.sklearn
12 pyLDAvis.enable_notebook()

ImportError: No module named pyLDAvis.sklearn
0 Kudos
1 Reply


I fully recognize it's been over a year since you asked this question, but I just came across this and in the spirit of information sharing: any python error message saying "No module named..." indicates that either that package is misspelled, or the code environment used does not include this package.

The topic modeling plugin requires a python environment with NLP specific packages such as pyLDAvis, which are not included in DSS's built-in environment, which is likely what you were using.

You (or an admin, depending on your level of permissions) will need to first create a code environment that includes the packages imported at the top of the topic modeling notebook and then you'll want to change the notebook's code environment.



0 Kudos
Labels (1)
A banner prompting to get Dataiku DSS