RecSys Challenge 2021

About

The RecSys Challenge 2021 will be organized by Politecnico di Bari, ETH Zürich, Jönköping University, and the data set will be provided by Twitter. The challenge focuses on a real-world task of tweet engagement prediction in a dynamic environment. For 2021, the challenge considers four different engagement types: Likes, Retweet, Quote, and replies.

This year's challenge brings the problem even closer to Twitter's real recommender systems by introducing latency constraints. We will also increase the data size to encourage novel methods. Also, the data density will be increased in terms of the graph where users are considered to be nodes and interactions as edges.

The goal is twofold: to predict the probability of different engagement types of a target user for a set of tweets based on heterogeneous input data while providing fair recommendations. We are conscious that multi-goal optimization considering accuracy and fairness will be particularly challenging. However, we believe that the recommendation community is nowadays mature enough to face the challenge of providing accurate and, at the same time, fair recommendations.

Participation and Data

Twitter will make available a public dataset of close to 1 billion data points, >40 million each day over 28 days. Week 1 - 3 will be used for training and week 4 for evaluation and testing. Each datapoint contains the tweet along with engagement features, user features, and tweet features.

If you are experiencing any issues, please notify the organizers.

Participation for this challenge is subject to your acceptance of these Terms & Conditions , and your successful completion of the steps required within, including the registration and approval process with the Twitter Developer Program

The Terms & Conditions already require that all submissions are accompanied by reproducible code, so that we can inspect winning solutions in detail: “Your Submission must include the source code and any related information used to derive the results contained in your Submission. The source code must be released under an open-source license (Apache 2.0). A third party should be able to use your submitted source to regenerate your results.” Furthermore, we explicitly state in our rules that NO de-anonymization or access to data from Twitter users and user behavior from the Twitter API other than that in the challenge dataset is allowed. Enriching the data with other data sources remains possible.

If any of the rules mentioned in the Terms & Conditions (and explained further above) are broken and thus discovered by the organizers in the code submission, the participant(s) that the submission belongs to will be disqualified from the competition. As mentioned in the Terms and Conditions: “Organizers reserve the right, in their sole discretion, to disqualify any participant who makes a Submission that does not meet the Requirements or is in violation of these Terms.”

Timeline top

Note: the timeline is subject to slight modifications.

When? What?
19 March, 2021 open to registration
22 March, 2021 Dataset Release & RecSys Challenge Starts

Training and validation datasets released

30 March, 2021 Evaluation server released
June 9, 2021 Validation datasets released

Test server released

June 23, 2021 RecSys Challenge ends
June 30, 2021 Announcement of the final leaderboard and winners Paper submission for RecSys Challenge Workshop
July 13, 2021
July 20, 2021
Paper Submission Due
August 10 August 31, 2021 Paper Acceptance Notifications
August 23 September 5, 2021 Camera-ready Papers Due
September 27 - October 1, 2021 Workshop taking place as part of the ACM RecSys conference.

Paper Submission Guidelines top

Submission website: EasyChair Now Active!



Workshop Program (Tentative) and Accepted Papers

The RecSys Challenge Workshop will take place on October 1st, 2021
All times are CEST
Time Session
14:00
-
14:05
Opening
14:05
-
14:30
Keynote - The 2021 RecSys Challenge Dataset. Fairness is not optional
14:30
-
15:00
Academic Leaderboard 3rd place - Team JKU-AIWarriors in the ACM Recommender Systems Challenge 2021: Lightweight XGBoost Recommendation Approach Leveraging User Features ALEXANDER KRAUCK, DAVID PENZ, MARKUS SCHEDL
15:00
-
15:30
General Leaderboard 3rd place - User Engagement Modeling with Deep Learning and Language Models MAKSIMS VOLKOVS, FELIPE PEREZ, ZHAOYUE CHENG, JIANING SUN, SAJAD NOROUZI, ANSON WONG, PAWEL JANKIEWICZ, BARUM RHO
  
15:00
-
16:00
Coffee Break
  
16:00
-
16:30
Academic Leaderboard 2nd place - Addressing the cold-start problem with a two-branch architecture for fair tweet recommendation PERE GILABERT, SANTI SEGUÍ
16:30
-
17:00
General Leaderboard 2nd place - Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model MICHAŁ DANILUK, JACEK DĄBROWSKI, BARBARA RYCHALSKA, KONRAD GOŁUCHOWSKI
17:00
-
17:30
Academic Leaderboard 1st place - Lightweight and Scalable Model for Tweet Engagements Predictions in a Resource-constrained Environment LUCA CARMINATI, GIACOMO LODIGIANI, PIETRO MALDINI, SAMUELE META, STIVEN METAJ, ARCANGELO PISA, ALESSANDRO SANVITO, MATTIA SURRICCHIO, FERNANDO BPÉREZ MAURERA, CESARE BERNARDIS, MAURIZIO FERRARI DACREMA
  
17:30
-
18:00
Coffee Break
  
18:00
-
18:30
General Leaderboard 1st place - GPU Accelerated Boosted Trees and Deep Neural Networks for Better Recommender Systems CHRIS DEOTTE, BO LIU, BENEDIKT SCHIFFERER, GILBERTO TITERICZ
18:30
-
19:00
Panel with Winners, Twitter, and Experts
19:00
-
19:10
Closing Remarks

Program Committee top

Academic Organizers top

RecSys Industry Mentors top

Advisors top