ykilcher

ykilcher

83 Followers
    Sparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explained
    57:06
    ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained)
    40:42
    Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)
    48:06
    Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in)
    11:09
    Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Review)
    39:14
    [ML News] Cedille French Language Model | YOU Search Engine | AI Finds Profitable MEME TOKENS
    36:05
    Gradients are Not All You Need (Machine Learning Research Paper Explained)
    48:29
    [ML News] Microsoft combines Images & Text | Meta makes artificial skin | Russians replicate DALL-E
    37:52
    [ML News] Google introduces Pathways | OpenAI solves Math Problems | Meta goes First Person
    36:45
    Autoregressive Diffusion Models (Machine Learning Research Paper Explained)
    34:23
    EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained)
    29:25
    [YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview)
    1:06:44
    [ML News] NVIDIA GTC'21 | DeepMind buys MuJoCo | Google predicts spreadsheet formulas
    21:23
    [ML News GERMAN] NVIDIA GTC'21 | DeepMind kauft MuJoCo | Google Lernt Spreadsheet Formeln
    26:56
    I went to an AI Art Festival in Geneva (AiiA Festival Trip Report)
    18:52
    Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)
    45:21
    I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen
    4:15
    [ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable
    27:51
    [ML News] DeepMind does Nowcasting | The Guardian's shady reporting | AI finishes Beethoven's 10th
    27:40
    Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)
    29:46
    How far can we scale up? Deep Learning's Diminishing Returns (Article Review)
    20:26
    [ML News] Plagiarism Case w/ Plot Twist | CLIP for video surveillance | OpenAI summarizes books
    30:51
    Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained)
    25:59
    [ML News] New ImageNet SOTA | Uber's H3 hexagonal coordinate system | New text-image-pair dataset
    14:13
    Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset
    13:18