
AST[16] concludes that the abnormal image features extracted by the teacher-student model with the same structure are significantly similar, so they propose an asymmetric teacher student architecture to address this issue.
AST also introduces a normalized flow to avoid this problem and prevent estimation bias caused by the inconsistency of the two network structures. ???

tab 5: NF student vs CNN student.

tab1 Overview of the used datasets.
tab6 inf. time [ms]