DATA MINING CUP 2022
23rd edition: 78 Teams from 59 universities in 23 countries
23rd edition: 78 Teams from 59 universities in 23 countries
Often, consumers purchase products with certain time intervals. Knowing which products customers buy during these time intervals is essential information for retailers in order to roll out optimal promotion plans and more. For example, customers demand for perfumes to run with longer intervals than body lotion.
This year the DATA MINING CUP is dedicated to this scenario. Given a retailer’s fixed product assortment, the participating teams are to determine which products customers buy on a cyclical basis. They are then challenged to develop a model that predicts these cycles for all relevant products and customer groups.
This year’s scenario is all about Pia and Philip, a married couple. They started their new e-commerce business during the pandemic in 2020 by offering convenience goods online. They began by selling an assortment of masks and disinfectants, but quickly expanded to a wider range of various everyday commodities.
Having both a background in traditional and online retail, they are aware of how distant and impersonal online shopping can feel and, at the same time, how important customer guidance and recommendations are for long-term customer loyalty.
To differentiate themselves from the many other commodity shops, they decided to put an even more significant emphasis on personalized recommendations and offers.
One key element of this strategy is a customized weekly newsletter that personally addresses each of their clients. The newsletter includes user favorites, products similar customers liked, new additions, and special offers.
However, they quickly noticed a problem: repeated recommendations of recently purchased products. One quick workaround for this issue was implementing a filter that would exclude products from the recommendation for a fixed number of days. This, however, did not meet the high standards of Pia and Philip.
They are instead looking for a model that can reliably predict the week that a returning customer might repurchase one of their frequently purchased items.
By knowing the estimated week of replenishment, products can be added to the newsletter as a reminder, thus increasing basket sizes and profits.
Since the owners are only interested in the best possible solution, they organized a contest to benchmark competing prediction approaches.
The participating teams’ goal is to predict the user-based replenishment of a product based on historical orders and item features. Individual items and user specific orders are given for the period between 01.06.2020 and 31.01.2021. The prediction period is between 01.02.2021 and 28.02.2021, which is exactly four weeks long.
For a predefined subset of user and product combinations, the participants shall predict if and when a product will be purchased during the prediction period.
The prediction column in the “submission.csv” file must be filled accordingly.
The different columns are separated by the “|” symbol. A possible example of the solution file might look like this:
userID|itemID|prediction
12|6723|0
20|8272|1
28|9873|4
…
The solution file must match the specifications described in the Data section. Incorrect or incomplete submissions cannot be assessed.
Team Uni_Asia_Pacific_1
Asia Pacific University of Technology & Innovation, Malaysia
Prize: 2,000.00 EUR
Team Uni_Asia_Pacific_2
Asia Pacific University of Technology & Innovation, Malaysia
Prize: 500.00 EUR
Don’t miss out on any news about the DATA MINING CUP!
GK Artificial Intelligence for Retail AG uses cookies to ensure you the best experience on our website. When you browse the website you agree to our use of cookies.
OKPrivacy & SettingsWe may request cookies to be set on your device. We use cookies to let us know when you visit our websites, how you interact with us, to enrich your user experience, and to customize your relationship with our website.
Click on the different category headings to find out more. You can also change some of your preferences. Note that blocking some types of cookies may impact your experience on our websites and the services we are able to offer.
These cookies are strictly necessary to provide you with services available through our website and to use some of its features.
Because these cookies are strictly necessary to deliver the website, refuseing them will have impact how our site functions. You always can block or delete cookies by changing your browser settings and force blocking all cookies on this website. But this will always prompt you to accept/refuse cookies when revisiting our site.
We fully respect if you want to refuse cookies but to avoid asking you again and again kindly allow us to store a cookie for that. You are free to opt out any time or opt in for other cookies to get a better experience. If you refuse cookies we will remove all set cookies in our domain.
We provide you with a list of stored cookies on your computer in our domain so you can check what we stored. Due to security reasons we are not able to show or modify cookies from other domains. You can check these in your browser security settings.
We also use different external services like Google Webfonts, Google Maps, and external Video providers. Since these providers may collect personal data like your IP address we allow you to block them here. Please be aware that this might heavily reduce the functionality and appearance of our site. Changes will take effect once you reload the page.
Google Webfont Settings:
Google Map Settings:
Vimeo and Youtube video embeds:
You can read about our cookies and privacy settings in detail on our Privacy Policy Page.
Privacy