Know the flaws of BERT that gave rise to its cousins! — Introduction With the onset of the Transformer, there has been a rapid rise in language models. In 2018, BERT came and broke all records. However, shortly after BERT, a long list of its cousins were born, like RoBERTa, ALBERT, StructBERT, DistilBERT, to name a few. BERT is essentially trained to optimise…