CNNs & ViTs (Vision Transfomers) - Comparing the internal structures, Maithra Raghu, @Google | Jay Shah | Podwise