DATA MINING CUP 2022
Be part of it!
Be part of it!
the DATA MINING CUP 2022 is over. Thanks to all participants. You are surely eagerly waiting for the official evaluation of the DATA MINING CUP 2022 as well as your results. On July 14, 2022 at 15:00 CEST this year’s DATA MINING CUP winners will be honored via Youtube Premiere.
We are looking forward to your participation.
Your DMC Team
This is the task
Often, consumers purchase products with certain time intervals. Knowing which products customers buy during these time intervals is essential information for retailers in order to roll out optimal promotion plans and more. For example, customers demand for perfumes to run with longer intervals than body lotion.
This year the DATA MINING CUP is dedicated to this scenario. Given a retailer’s fixed product assortment, the participating teams are to determine which products customers buy on a cyclical basis. They are then challenged to develop a model that predicts these cycles for all relevant products and customer groups.
This year’s scenario is all about Pia and Philip, a married couple. They started their new e-commerce business during the pandemic in 2020 by offering convenience goods online. They began by selling an assortment of masks and disinfectants, but quickly expanded to a wider range of various everyday commodities.
Having both a background in traditional and online retail, they are aware of how distant and impersonal online shopping can feel and, at the same time, how important customer guidance and recommendations are for long-term customer loyalty.
To differentiate themselves from the many other commodity shops, they decided to put an even more significant emphasis on personalized recommendations and offers.
One key element of this strategy is a customized weekly newsletter that personally addresses each of their clients. The newsletter includes user favorites, products similar customers liked, new additions, and special offers.
However, they quickly noticed a problem: repeated recommendations of recently purchased products. One quick workaround for this issue was implementing a filter that would exclude products from the recommendation for a fixed number of days. This, however, did not meet the high standards of Pia and Philip.
They are instead looking for a model that can reliably predict the week that a returning customer might repurchase one of their frequently purchased items.
By knowing the estimated week of replenishment, products can be added to the newsletter as a reminder, thus increasing basket sizes and profits.
Since the owners are only interested in the best possible solution, they organized a contest to benchmark competing prediction approaches.
The participating teams’ goal is to predict the user-based replenishment of a product based on historical orders and item features. Individual items and user specific orders are given for the period between 01.06.2020 and 31.01.2021. The prediction period is between 01.02.2021 and 28.02.2021, which is exactly four weeks long.
For a predefined subset of user and product combinations, the participants shall predict if and when a product will be purchased during the prediction period.
The prediction column in the “submission.csv” file must be filled accordingly.
The different columns are separated by the “|” symbol. A possible example of the solution file might look like this:
userID|itemID|prediction
12|6723|0
20|8272|1
28|9873|4
…
The solution file must match the specifications described in the Data section. Incorrect or incomplete submissions cannot be assessed.
Task | DMC-2022-Task 1 file(s) 7.24 MB
|
Open until June 26, 2022
Please check how many teams of your university are already taking part in the competition. The team list (scroll further down) has a search function. If there are two teams already, please don’t register but get in touch with the team leaders of the teams and join one of the teams. If you register, we will give you the contact details of the team leaders. Team members don’t need to be registered separately.
Please note:
Please provide as much data as possible with your registration. We will use your
Institutional e-mail address
Public e-mail address
You can choose, which e-mail address we are going to use for any further communication with you.
Confirmation of your registration
Please read our Conditions of Participartion and make sure your team registration meets all requirements for taking part in the international student competition for data mining.
Search for your educational institution to see if, or how many, teams are already registered. Please try to search the name in your own language, in English and abbreviations of the name.
You will also see the name of the team leader(s). If you want to join an existing team, please contact us for the contact details of the team leader.
Please select a column to search in.
Please type in something to search for.
Don’t miss out on any news about the DATA MINING CUP!
prudsys AG uses cookies to ensure you the best experience on our website. When you browse the website you agree to our use of cookies.
OKPrivacy & SettingsWe may request cookies to be set on your device. We use cookies to let us know when you visit our websites, how you interact with us, to enrich your user experience, and to customize your relationship with our website.
Click on the different category headings to find out more. You can also change some of your preferences. Note that blocking some types of cookies may impact your experience on our websites and the services we are able to offer.
These cookies are strictly necessary to provide you with services available through our website and to use some of its features.
Because these cookies are strictly necessary to deliver the website, refuseing them will have impact how our site functions. You always can block or delete cookies by changing your browser settings and force blocking all cookies on this website. But this will always prompt you to accept/refuse cookies when revisiting our site.
We fully respect if you want to refuse cookies but to avoid asking you again and again kindly allow us to store a cookie for that. You are free to opt out any time or opt in for other cookies to get a better experience. If you refuse cookies we will remove all set cookies in our domain.
We provide you with a list of stored cookies on your computer in our domain so you can check what we stored. Due to security reasons we are not able to show or modify cookies from other domains. You can check these in your browser security settings.
We also use different external services like Google Webfonts, Google Maps, and external Video providers. Since these providers may collect personal data like your IP address we allow you to block them here. Please be aware that this might heavily reduce the functionality and appearance of our site. Changes will take effect once you reload the page.
Google Webfont Settings:
Google Map Settings:
Vimeo and Youtube video embeds:
You can read about our cookies and privacy settings in detail on our Privacy Policy Page.
Privacy 2018