Time series analysis with powerful tools

There are a lot of aspects to time series analysis. One method is called Holt-Winters and uses exponential smoothing to handle trend and seasonality. If you want, you can read about it in the original paper called “Forecasting Sales by Exponentially Weighted Moving Averages” by Peter R. Winters […]

Text Analytics on Customer Reviews

When the input data is text, such as customer reviews, we cannot use the quantitative techniques without prior coding of the text. To experiment with text analytics, you can use a dataset such as this: https://www.kaggle.com/daishinkan002/men-women-shoes-reviews containing product reviews (of shoes). In this dataset, the variable “reviews” contains […]

Will the customer buy?

A key challenge to marketing campaigns is to target the customers that are likely to buy, and not waste time and money on those who are less likely to buy. On Kaggle, there is a dataset from a Portugese bank’s marketing campaign. See Bank marketing campaigns dataset | […]

Predict Annual Spending in E-Commerce

Use the data available at https://www.kaggle.com/kolawale/focusing-on-mobile-app-or-website What Questions can you formulate, considering the data you have? What Conceptual reasoning do you suggest? What Analytics approach could be appropriate? What Results do you get? On Kaggle, users can submit reports on their approaches. Now have a look at one […]

Weighted Ranking – Olympics Medals

Finding “the best” can be tricky when we can value different aspects in different ways. What is best for you might not be best for me. A way to include multiple aspects when ranking a number of “contestants” is to include weights. Get data on the number of […]

Finding Datasets Online

Collecting data is an important part of analysis, and an important skill of an analyst. However, sometimes we can use existing data sets, either because they already exist within our organisation, or because they are easily accessible. Working with pre-existing datasets lets us practice the other aspects of […]

Cluster Analysis of Data set

Find a data set you find interesting, and perform cluster analysis on it. The data should have at least two variables with good variation. Perform k-means clustering using R, Python, or an online tool. Some online tools are also great at explaining and visualizing what happens. Try https://datatab.net/statistics-calculator/cluster […]

Try Machine Learning

Machine Learning is a term used for a number of different techniques where hyper-dimensional datasets can be used without prior conceptualization and definition of the relationships between variables. The “machine” will “learn” from the data. As a result, a ML-model can be very complex and thus treated as […]

Predict the number of cyclists

In the city of Västerås, Sweden, the number of pedestrians and cyclists are counted in different places around the city. A dashboard showing the counts is available online: https://data.eco-counter.com/ParcPublic//?id=4615 By looking at the calls the webpage makes, we can get the data source for the different counters, for […]

Collect Data using Interviews or Focus Groups

Challenge: Get (qualitative) data for your analysis without influencing the interviewee. Instructions You are asked to contribute to the development of a product (choose any). Formulate a Question that can guide you! Through own reflection or – better – group discussion, formulate the Concepts, i.e. key aspects of […]