Infosys Interview Question

Questions from Transformer architecture, attention machanism, BERT, ENCODER DECODER. Python Coding. Why we use Softmax layer in deep learning. Outlier, how to handle missing value. Uses of regularizer