Awesome Transfer LearningΒΆ
A list of awesome papers and cool resources on transfer learning, domain adaptation and domain-to-domain translation in general! As you will notice, this list is currently mostly focused on domain adaptation (DA) and domain-to-domain translation, but donβt hesitate to suggest resources in other subfields of transfer learning. I accept pull requests.
Table of ContentsΒΆ
Tutorials and BlogsΒΆ
PapersΒΆ
Papers are ordered by theme and inside each theme by publication date (submission date for arXiv papers). If the network or algorithm is given a name in a paper, this one is written in bold before the paperβs name.
SurveysΒΆ
Deep Transfer LearningΒΆ
Transfer of deep learning models.
Fine-tuning approachΒΆ
Feature extraction (embedding) approachΒΆ
Multi-task learningΒΆ
- Learning without forgetting (2016)
Policy transfer for RLΒΆ
Few-shot transfer learningΒΆ
Meta transfer learningΒΆ
ApplicationsΒΆ
Medical imaging:ΒΆ
- Deep Convolutional Neural Networks forComputer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning (2016)
- Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? (2017)
- Comparison of deep transfer learning strategies for digital pathology (2018)
Unsupervised Domain AdaptationΒΆ
Transfer between a source and a target domain. In unsupervised domain adaptation, only the source domain can have labels.
Adversarial methodsΒΆ
Learning a latent spaceΒΆ
- DANN: Domain-Adversarial Training of Neural Networks (2015)
- JAN: Deep Transfer Learning with Joint Adaptation Networks (2016)
- CoGAN: Coupled Generative Adversarial Networks (2016)
- DRCN: Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation (2016)
- DSN: Domain Separation Networks (2016)
- ADDA: Adaptative Discriminative Domain Adaptation (2017)
- GenToAdapt: Generate To Adapt: Aligning Domains using Generative Adversarial Networks (2017)
- WDGRL: Wasserstein Distance Guided Representation Learning for Domain Adaptation (2017)
- CyCADA: CyCADA: Cycle-Consistent Adversarial Domain Adaptation (2017)
- DIRT-T: A DIRT-T Approach to Unsupervised Domain Adaptation (2017)
- DupGAN: Duplex Generative Adversarial Network for Unsupervised Domain Adaptation (2018)
- MSTN: Learning Semantic Representations for Unsupervised Domain Adaptation (2018)
Image-to-Image translationΒΆ
- DIAT: Deep Identity-aware Transfer of Facial Attributes (2016)
- Pix2pix: Image-to-Image Translation with Conditional Adversarial Networks (2016)
- DTN: Unsupervised Cross-domain Image Generation (2016)
- SimGAN: Learning from Simulated and Unsupervised Images through Adversarial Training (2016) (2016)
- PixelDA: Unsupervised PixelβLevel Domain Adaptation with Generative Adversarial Networks (2016)
- UNIT: Unsupervised Image-to-Image Translation Networks (2017)
- CycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks (2017)
- DiscoGAN: Learning to Discover Cross-Domain Relations with Generative Adversarial Networks (2017)
- DualGAN: DualGAN: Unsupervised Dual Learning for Image-to-Image Translation (2017)
- SBADA-GAN: From source to target and back: symmetric bi-directional adaptive GAN (2017)
- DistanceGAN: One-Sided Unsupervised Domain Mapping (2017)
- pix2pixHD: High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs (2018)
- I2I: Image to Image Translation for Domain Adaptation (2017)
- MUNIT: Multimodal Unsupervised Image-to-Image Translation (2018)
Multi-source adaptationΒΆ
- StarGAN: StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation (2017)
- XGAN: XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings (2017)
- BicycleGAN : Toward Multimodal Image-to-Image Translation (2017)
- Label Efficient Learning of Transferable Representations across Domains and Tasks (2017)
- ComboGAN: ComboGAN: Unrestrained Scalability for Image Domain Translation (2017)
- AugCGAN: Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data (2018)
- RadialGAN: RadialGAN: Leveraging multiple datasets to improve target-specific predictive models using Generative Adversarial Networks (2018)
- MADA: Multi-Adversarial Domain Adaptation (2018)
- MDAN: Multiple Source Domain Adaptation with Adversarial Learning (2018)
Temporal models (videos)ΒΆ
- Model F: Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos (2017)
- RecycleGAN: Recycle-GAN: Unsupervised Video Retargeting (2018)
- Vid2vid: Video-to-Video Synthesis (2018)
- Temporal Smoothing (TS): Everybody Dance Now (2018)
Optimal TransportΒΆ
- OT: Optimal Transport for Domain Adaptation (2015)
- Theoretical Analysis of Domain Adaptation with Optimal Transport (2016)
- JDOT: Joint distribution optimal transportation for domain adaptation (2017)
- Monge map learning: Large Scale Optimal Transport and Mapping Estimation (2017)
- JCPOT: Optimal Transport for Multi-source Domain Adaptation under Target Shift (2018)
- DeepJDOT: DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptation (2018)
Embedding methodsΒΆ
- Unsupervised Domain Adaptation for Zero-Shot Learning (2015)
- DAassoc : Associative Domain Adaptation (2017)
Kernel methodsΒΆ
Autoencoder approachΒΆ
Subspace LearningΒΆ
- SGF: Domain Adaptation for Object Recognition: An Unsupervised Approach (2011)
- GFK: Geodesic Flow Kernel for Unsupervised Domain Adaptation (2012)
- SA: Unsupervised Visual Domain Adaptation Using Subspace Alignment (2015)
- CORAL: Return of Frustratingly Easy Domain Adaptation (2015)
- Deep CORAL: Deep CORAL: Correlation Alignment for Deep Domain Adaptation (2016)
- ILS: Learning an Invariant Hilbert Space for Domain Adaptation (2016)
- Log D-CORAL: Correlation Alignment by Riemannian Metric for Domain Adaptation (2017)
Self-Ensembling methodsΒΆ
- MT: Self-ensembling for domain adaptation (2017)
OtherΒΆ
Semi-supervised Domain AdaptationΒΆ
All the source points are labelled, but only few target points are.
General methodsΒΆ
- da+lap-sim : Semi-Supervised Domain Adaptation with Instance Constraints (2013)
Subspace learningΒΆ
Copulas methodsΒΆ
Few-shot Supervised Domain AdaptationΒΆ
Only a few target examples are available, but they are labelled
Adversarial methodsΒΆ
- FADA: Few-Shot Adversarial Domain Adaptation (2017)
- Augmented-Cyc: Augmented Cyclic Adversarial Learning for Domain Adaptation (2018)
Embedding methodsΒΆ
Applied Domain AdaptationΒΆ
Domain adaptation applied to other fields
PhysicsΒΆ
- Learning to Pivot with Adversarial Networks (2016)
- Adversarial Domain Adaptation for Identifying Phase Transitions (2017)
- Identifying Quantum Phase Transitions with Adversarial Neural Networks (2017)
- Automated discovery of characteristic features of phase transitions in many-body localization (2017)
DatasetsΒΆ
Image-to-imageΒΆ
- MNIST vs MNIST-M vs SVHN vs Synth vs USPS: digit images
- GTSRB vs Syn Signs : traffic sign recognition datasets, transfer between real and synthetic signs.
- NYU Depth Dataset V2: labeled paired images taken with two different cameras (normal and depth)
- CelebA: faces of celebrities, offering the possibility to perform gender or hair color translation for instance
- Office-Caltech dataset: images of office objects from 10 common categories shared by the Office-31 and Caltech-256 datasets. There are in total four domains: Amazon, Webcam, DSLR and Caltech.
- Cityscapes dataset: street scene photos (source) and their annoted version (target)
- UnityEyes vs MPIIGaze: simulated vs real gaze images (eyes)
- CycleGAN datasets: horse2zebra, apple2orange, cezanne2photo, monet2photo, ukiyoe2photo, vangogh2photo, summer2winter
- pix2pix dataset: edges2handbags, edges2shoes, facade, maps
- RaFD: facial images with 8 different emotions (anger, disgust, fear, happiness, sadness, surprise, contempt, and neutral). You can transfer a face from one emotion to another.
- VisDA 2017 classification dataset: 12 categories of object images in 2 domains: 3D-models and real images.
- Office-Home dataset: images of objects in 4 domains: art, clipart, product and real-world.
Text-to-textΒΆ
- Amazon review benchmark dataset: sentiment analysis for four kinds (domains) of reviews: books, DVDs, electronics, kitchen
- ECML/PKDD Spam Filtering: emails from 3 different inboxes, that can represent the 3 domains.
- 20 Newsgroup: collection of newsgroup documents across 6 top categories and 20 subcategories. Subcategories can play the role of the domains, as describe in this article.
ResultsΒΆ
The results are indicated as the prediction accuracy (in %) in the target domain after adapting the source to the target. For the moment, they only correspond to the results given in the original papers, so the methodology may vary between each paper and these results must be taken with a grain of salt.
Digits transfer (unsupervised)ΒΆ
Sour ceTa rget | MNIS TMNI ST-M | Synt hSVH N | MNIS TSVH N | SVHN MNIS T | MNIS TUSP S | USPS MNIS T |
---|---|---|---|---|---|---|
SA | 56.9 0 | 86.4 4 | ? | 59.3 2 | ? | ? |
DANN | 76.6 6 | 91.0 9 | ? | 73.8 5 | ? | ? |
CoGA N | ? | ? | ? | ? | 91.2 | 89.1 |
DRCN | ? | ? | 40.0 5 | 81.9 7 | 91.8 0 | 73.6 7 |
DSN | 83.2 | 91.2 | ? | 82.7 | ? | ? |
DTN | ? | ? | 90.6 6 | 79.7 2 | ? | ? |
Pixe lDA | 98.2 | ? | ? | ? | 95.9 | ? |
ADDA | ? | ? | ? | 76.0 | 89.4 | 90.1 |
UNIT | ? | ? | ? | 90.5 3 | 95.9 7 | 93.5 8 |
GenT oAda pt | ? | ? | ? | 92.4 | 95.3 | 90.8 |
SBAD A-GA N | 99.4 | ? | 61.1 | 76.1 | 97.6 | 95.0 |
DAas soc | 89.4 7 | 91.8 6 | ? | 97.6 0 | ? | ? |
CyCA DA | ? | ? | ? | 90.4 | 95.6 | 96.5 |
I2I | ? | ? | ? | 92.1 | 95.1 | 92.2 |
DIRT -T | 98.7 | ? | 76.5 | 99.4 | ? | ? |
Deep JDOT | 92.4 | ? | ? | 96.7 | 95.7 | 96.4 |
LibrariesΒΆ
No good library for the moment (as far as I know). If youβre interested in a project of creating a generic transfer learning/domain adaptation library, please let me know.