0% found this document useful (0 votes)

43 views

One Person, One Model, One World: Learning Continual User Representation Without Forgetting

1) The document proposes a new method called Conure for building lifelong user representation models that can continually learn new tasks without forgetting previous tasks. 2) Conure uses an over-parameterized model and regularization to prevent catastrophic forgetting when learning new tasks sequentially. 3) Experiments on recommendation datasets show that Conure outperforms other baselines and lifelong learning methods at learning multiple sequential tasks without forgetting.

Uploaded by

lifengyi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

One Person, One Model, One World: Learning Continual User Representation Without Forgetting

Uploaded by

lifengyi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

One Person, One Model, One World:

Learning Continual User Representation

without Forgetting
SIGIR2021
Data&Code: https://github.com/fajieyuan/SIGIR2021_Conure

Fajie Yuan （Westlake University, Tencent）, Guoxiao Zhang（Tencent ）,

Alexandros Karatzoglou (Google Research), Joemon Jose (University of Glasgow),
Beibei Kong (Tencent), Yudong Li(Tencent)
Outline

 Motivation
 Related Work
 Conure
 Experiments
Our Motivation
A person has different roles to play in life！
But all these roles may have some
commonalities, such as personalization,
habits, preference.

Our Focus:
Whether we can build a user representation model that could
keep learning throughout all sequential tasks without forgetting One Person, One Model, One World
News Rec

Video Rec

Car Rec Map

APP

Search
A person has different roles to
play in life！But all these roles Engine
Music Rec
may have some commonalities,
such as personalization,
habits, preference.

Browser
Social APP

Video Rec Video Rec

Video Rec

Social APP
Clicking logs

No interaction

TikTok -- warm user Amazon —cold users Ads --- new users
Using Lifelong learning techniques to solve recommendation tasks

Keypoints
• Necessity and possibility why lifelong learning for UR learning?
• Lifelong learning paradigm throughout all tasks.
• Performance gain for tasks have certain correlations.
Outline
 Motivation
 Related Work
 Conure
 Experiments
• Classical UR models (works well but is specific to only one task)

GRU4Rec (Hidasi et al ICLR2016) NextItNet (Yuan et al WSDM2019)

SASRec(Kang et al ICDM2018) DSSM(Huang et al CIKM2013) Grec (Yuan et al WWW2020 )

• PeterRec (Two-stage Transfer Learning):
• PeterRec (Finetuning):
• Transfer Learning Paradigm Comparisons:

(a) Standard TF (b) PeterRec (b) Conure (b) MTL

Lifelong learning without parameter preserving

Outline
 Motivation
 Related Work
 Conure
 Experiments
• Catastrophic Forgetting :
Parameter
Changes

Last hidden
Vector Changes
• Over-parameterization:

—（1） the more parameters are pruned, the worse it performs

—（2） performing retraining on the pruned network (i.e., “pr70+retrain”) regains its original accuracy quickly
—（3） smaller models （i.e., (b)）are also highly over-parameterized
• Conure architecture and learning process.

Conure is conceptually very

simple, easy to implement,
and applicable to various
sequential encoder networks.
Outline
 Motivation
 Related Work
 Conure
 Experiments
• Datasets:
TTL: https://drive.google.com/file/d/1imhHUsivh6oMEtEW-RwVc4OsDqn-xOaP/view
ML: https://drive.google.com/file/d/1-_KmnZFaOdH11keLYVcgkf-kW_BaM266/view
• Results:

—（1） Conure largely outperforms other models on T3 because of the positive transfer from T1 and T2
—（2） Conure, PeterRec and FineAll largely outperforms SimMo because of of the positive transfer from T1
—（3） SinMoAll performs much worse on most tasks (except the last one) because of catastrophic forgetting
• Ablation study- T2 for T3:

—（1） Without training T2，Conure shows worse results，e.g., -6.5% on TTL20%

• Ablation study- Task order:

—（1） Conure is not sensitive to the task order.

• Ablation study:

—（1） pruning also works for the embedding layer

—（1） Conure is not restricted to specialized

sequential encoder.
—（2） Conure with the Transformer backbone works
a bit better than it with NextItNet.
Contributions:
—（1）providing the first lifelong learning paradigm for user representations.
—（2）providing insights for forgetting and redundancy issues in user representation models
—（3）designing Conure, the first lifelong learning algorithm - smple and easy to implement
—（4）instantiazing Conure with NextItNet and Transformer backbones
—（5）Extensive experiments with SOTA performance with many new discoveries and insights
Case study:

CCDI - G8 - T2 - AY23-24 (Student Book)
No ratings yet
CCDI - G8 - T2 - AY23-24 (Student Book)
115 pages
Learning Software Engineering
From Everand
Learning Software Engineering
IT Campus Academy
No ratings yet
Mac Address OUI - Vendor Find List
67% (12)
Mac Address OUI - Vendor Find List
371 pages
Birla Institute of Technology & Science, Pilani (Raj.) : Work Integrated Learning Programmes Division
No ratings yet
Birla Institute of Technology & Science, Pilani (Raj.) : Work Integrated Learning Programmes Division
2 pages
Syscal Kid Manual
No ratings yet
Syscal Kid Manual
26 pages
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
From Everand
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
Mark Magic
No ratings yet
Spring and Spring Boot Interview Questions and Answers. Tech interviewer’s notes
From Everand
Spring and Spring Boot Interview Questions and Answers. Tech interviewer’s notes
John Edward Cooper Berg
5/5 (3)
Machine Learning with PyTorch: From Basics to Expert Proficiency
From Everand
Machine Learning with PyTorch: From Basics to Expert Proficiency
William Smith
No ratings yet
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
Python-Based Evolutionary Algorithms for Engineers
From Everand
Python-Based Evolutionary Algorithms for Engineers
Pankaj Jayaraman
No ratings yet
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
From Everand
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
Ankur Roy
No ratings yet
Learning Advanced Programming
From Everand
Learning Advanced Programming
IT Campus Academy
No ratings yet
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
Spring 2.5 Aspect Oriented Programming
From Everand
Spring 2.5 Aspect Oriented Programming
Massimiliano DessÃ¬
No ratings yet
Agile Scrum Handbook – 3rd edition
From Everand
Agile Scrum Handbook – 3rd edition
Nader K. Rad
No ratings yet
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
From Everand
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Java™ Programming: A Complete Project Lifecycle Guide
From Everand
Java™ Programming: A Complete Project Lifecycle Guide
Nitin Shreyakar
No ratings yet
Learn Operating System in 24 Hours
From Everand
Learn Operating System in 24 Hours
Alex Nordeen
No ratings yet
Professional Test Driven Development with C#: Developing Real World Applications with TDD
From Everand
Professional Test Driven Development with C#: Developing Real World Applications with TDD
James Bender
No ratings yet
Learn OpenCV with Python by Examples
From Everand
Learn OpenCV with Python by Examples
James Chen
No ratings yet
Modeling and Simulation of Discrete Event Systems
From Everand
Modeling and Simulation of Discrete Event Systems
Byoung Kyu Choi
No ratings yet
Master DotNET Fundamentals: Dot Net Interview Preparation, #1
From Everand
Master DotNET Fundamentals: Dot Net Interview Preparation, #1
Nirbhay Chauhan
No ratings yet
Cracking the Golang Coding Interview: A Comprehensive Guide to Algorithmic Problem Solving
From Everand
Cracking the Golang Coding Interview: A Comprehensive Guide to Algorithmic Problem Solving
Aarav Joshi
No ratings yet
Learning Programming and Computer Science: 1, #1
From Everand
Learning Programming and Computer Science: 1, #1
MATHY WISDOM
No ratings yet
Design Patterns in Swift: A Different Approach to Coding with Swift
From Everand
Design Patterns in Swift: A Different Approach to Coding with Swift
Vamshi Krishna
No ratings yet
Functional Programming with C#: Unlock coding brilliance with the power of functional magic
From Everand
Functional Programming with C#: Unlock coding brilliance with the power of functional magic
Alex Yagur
No ratings yet
Foundations of Computing: Essential for Computing Studies, Profession And Entrance Examinations - 5th Edition
From Everand
Foundations of Computing: Essential for Computing Studies, Profession And Entrance Examinations - 5th Edition
Pradeep K. Sinha
No ratings yet
PyTorch Cookbook
From Everand
PyTorch Cookbook
Matthew Rosch
No ratings yet
PyTorch Cookbook: 100+ Solutions across RNNs, CNNs, python tools, distributed training and graph networks
From Everand
PyTorch Cookbook: 100+ Solutions across RNNs, CNNs, python tools, distributed training and graph networks
Matthew Rosch
No ratings yet
Agile & Scrum Methodologies
From Everand
Agile & Scrum Methodologies
Ajit Singh
No ratings yet
Automating Software Tests Using Selenium
From Everand
Automating Software Tests Using Selenium
Hugo Peres
No ratings yet
ASP.NET Core 1.0 High Performance
From Everand
ASP.NET Core 1.0 High Performance
James Singleton
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
MCS-024: Object Oriented Technologies and Java Programming
From Everand
MCS-024: Object Oriented Technologies and Java Programming
Dr. DK Sukhani
No ratings yet
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
From Everand
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
Chizitere Sylvia Olebu
No ratings yet
Touchpad Modular Ver. 1.1 Class 7
From Everand
Touchpad Modular Ver. 1.1 Class 7
Team Orange
No ratings yet
MCS-034: Software Engineering
From Everand
MCS-034: Software Engineering
Dr. DK Sukhani
No ratings yet
The GitOps Handbook: Simplifying Cloud-Native DevOps Workflows
From Everand
The GitOps Handbook: Simplifying Cloud-Native DevOps Workflows
Robert Johnson
No ratings yet
Design Principles in Architecture
From Everand
Design Principles in Architecture
Rajendra Asan
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
CINEMA 4D R15 Fundamentals: For Teachers and Students
From Everand
CINEMA 4D R15 Fundamentals: For Teachers and Students
Anson Call
5/5 (1)
Prompt Engineering Tutorial – Master ChatGPT and LLM Responses
From Everand
Prompt Engineering Tutorial – Master ChatGPT and LLM Responses
tarek mohamed
5/5 (1)
Building Intelligent Applications with Azure OpenAI: End-to-End Solutions in Conversational Programming and LLMs
From Everand
Building Intelligent Applications with Azure OpenAI: End-to-End Solutions in Conversational Programming and LLMs
Aarav Joshi
No ratings yet
Python Performance Engineering: Strategies and Patterns for Optimized Code
From Everand
Python Performance Engineering: Strategies and Patterns for Optimized Code
Aarav Joshi
No ratings yet
Kotlin Coroutines Programming
From Everand
Kotlin Coroutines Programming
Onyx Rose
No ratings yet
Deep Learning on Microcontrollers: Learn how to develop embedded AI applications using TinyML (English Edition)
From Everand
Deep Learning on Microcontrollers: Learn how to develop embedded AI applications using TinyML (English Edition)
Atul Krishna Gupta
5/5 (1)
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
The World Of Agile:Incarnation Of DevOps
From Everand
The World Of Agile:Incarnation Of DevOps
Binayaka Mishra
No ratings yet
Software Engineering New Approach (Traditional and Agile Methodologies)
From Everand
Software Engineering New Approach (Traditional and Agile Methodologies)
Ramisetty Rajeswara Rao
No ratings yet
Software Reuse: Methods, Models, Costs, Second Edition
From Everand
Software Reuse: Methods, Models, Costs, Second Edition
Ronald J. Leach
No ratings yet
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
From Everand
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
Maicon Melo Alves
No ratings yet
C# Debugging from Scratch: A Practical Guide with Examples
From Everand
C# Debugging from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
JUnit in Depth: Definitive Reference for Developers and Engineers
From Everand
JUnit in Depth: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
Promptable Behaviors: Personalizing Multi-Objective Rewards From Human Preferences
No ratings yet
Promptable Behaviors: Personalizing Multi-Objective Rewards From Human Preferences
21 pages
GPT Self-Supervision For A Better Data Annotator: Preprint. Under Review
No ratings yet
GPT Self-Supervision For A Better Data Annotator: Preprint. Under Review
15 pages
Learn2learn A Library For Meta-Learning Research
No ratings yet
Learn2learn A Library For Meta-Learning Research
10 pages
Comparison of Multiple Reinforcement Learning and Deep Reinforcement Learning Methods For The Task Aimed at Achieving The Goal
No ratings yet
Comparison of Multiple Reinforcement Learning and Deep Reinforcement Learning Methods For The Task Aimed at Achieving The Goal
9 pages
A Continual Learning Survey Defying Forgetting in Classification Tasks
No ratings yet
A Continual Learning Survey Defying Forgetting in Classification Tasks
20 pages
Minitab Guide
No ratings yet
Minitab Guide
21 pages
Advanced_battery_management_system_using_MATLAB_Simulink
No ratings yet
Advanced_battery_management_system_using_MATLAB_Simulink
6 pages
Auditing Assignment
100% (1)
Auditing Assignment
17 pages
OBD Modes of Operation (Diagnostic Services)
No ratings yet
OBD Modes of Operation (Diagnostic Services)
13 pages
Power Query Shortcuts • My Online Training Hub
No ratings yet
Power Query Shortcuts • My Online Training Hub
7 pages
IEEE 37 Node Test Feeder: Distribution System Analysis Subcommittee
No ratings yet
IEEE 37 Node Test Feeder: Distribution System Analysis Subcommittee
16 pages
Hybrid Active - Passiver Filter System US Patent 5757099
No ratings yet
Hybrid Active - Passiver Filter System US Patent 5757099
5 pages
Hydraulic Fracturing Reproduced in Reservoir Model PDF
No ratings yet
Hydraulic Fracturing Reproduced in Reservoir Model PDF
9 pages
Max Flow Min Cut
No ratings yet
Max Flow Min Cut
8 pages
J2534 Reprogramming User Guide For Version 4.6 and Higher: Updated December 18, 2012
100% (1)
J2534 Reprogramming User Guide For Version 4.6 and Higher: Updated December 18, 2012
21 pages
Unit 2 Extra Reading MCH 2019
No ratings yet
Unit 2 Extra Reading MCH 2019
2 pages
Minnesota
No ratings yet
Minnesota
66 pages
Aswini
No ratings yet
Aswini
3 pages
03 March 1992
No ratings yet
03 March 1992
92 pages
Big Data For Development: Challenges & Opportunities
No ratings yet
Big Data For Development: Challenges & Opportunities
47 pages
In Gov transport-RVCER-MH45AJ5508
No ratings yet
In Gov transport-RVCER-MH45AJ5508
1 page
Structural Engineering Software
No ratings yet
Structural Engineering Software
2 pages
Numbers and Nerves Information Emotion and Meaning in a World of Data 1st Edition Scott Slovic (Editor) - Explore the complete ebook content with the fastest download
100% (1)
Numbers and Nerves Information Emotion and Meaning in a World of Data 1st Edition Scott Slovic (Editor) - Explore the complete ebook content with the fastest download
45 pages
FUE950 Datasheet
No ratings yet
FUE950 Datasheet
11 pages
Notes On Work System Concepts1
No ratings yet
Notes On Work System Concepts1
12 pages
Vertical Deflection Output Ic: Ics For TV
No ratings yet
Vertical Deflection Output Ic: Ics For TV
7 pages
RAC6690 A+ Feature-20120922-A-V1.0
No ratings yet
RAC6690 A+ Feature-20120922-A-V1.0
26 pages
4.cisco Router Booting Process Explained With Examples
No ratings yet
4.cisco Router Booting Process Explained With Examples
13 pages
Manual. Auriculares SONY MDR-NC-8
No ratings yet
Manual. Auriculares SONY MDR-NC-8
4 pages
Complements: 2 Types of Complements R's Complement (r-1) 'S Complement
No ratings yet
Complements: 2 Types of Complements R's Complement (r-1) 'S Complement
20 pages
SQL Tutorial: Saad Bashir Alvi 1
No ratings yet
SQL Tutorial: Saad Bashir Alvi 1
50 pages

One Person, One Model, One World: Learning Continual User Representation Without Forgetting

Uploaded by

One Person, One Model, One World: Learning Continual User Representation Without Forgetting

Uploaded by

One Person, One Model, One World:

Learning Continual User Representation

Fajie Yuan （Westlake University, Tencent）, Guoxiao Zhang（Tencent ）,

Car Rec Map

Video Rec Video Rec

GRU4Rec (Hidasi et al ICLR2016) NextItNet (Yuan et al WSDM2019)

SASRec(Kang et al ICDM2018) DSSM(Huang et al CIKM2013) Grec (Yuan et al WWW2020 )

(a) Standard TF (b) PeterRec (b) Conure (b) MTL

Lifelong learning without parameter preserving

—（1） the more parameters are pruned, the worse it performs

Conure is conceptually very

—（1） Without training T2，Conure shows worse results，e.g., -6.5% on TTL20%

—（1） Conure is not sensitive to the task order.

—（1） pruning also works for the embedding layer

—（1） Conure is not restricted to specialized

You might also like