Natural Language Processing of German texts - Part 2: Using LSTM neural-networks to predict ratings

Using a unique German data set containing ratings and comments on doctors, we build a Binary Text Classifier. In part 1 we've introduced a complete machine learning work flow that predicts ratings from comments. In this second part, we improve on our baseline by implementing a LSTM neural network model and using FastText embeddings. Using Keras for feature creation and prediction, we improve on the ability to understand comment sentiments.

more ...

Natural Language Processing of German texts - Part 1: Using machine-learning to predict ratings

Using a unique German data set containing ratings and comments on doctors, we build a Binary Text Classifier. To do so, we implement a complete machine learning work flow that predicts ratings from comments. In this first part, we start with basic methods. We go through text pre processing, feature creation (TF-IDF), classification and model optimization. Finally, we evaluate our model's ability to predict the sentiment of comments.

more ...

Interactive plots of large data sets made easy: Datashader

When plotting huge data sets using Python while keeping interactivity, Datashader is paramount. In this post, I demonstrate the abilities of this powerful and convenient library. We use an unique dataset containing a whole year of shared bike usage in Cologne to plot over a million locations on a map.

more ...

Interactive maps with Python made easy: Introducing Geoviews

Do you want to build map visualizations in Python? Look no further than GeoViews. It is not only super simple to use but also offers several interactive features that make your visualization stand out. Using geo spatial data from our bike rental data set we explore some of the possibilities.

more ...