All the selected models achieved kappa values of 0.6 or higher, indicating substantial agreement, and the AUC score for classifying standard images based on the total score was 0.89. The activation map of the trained models properly reflected the structural features of the image. The time lapse for standard image classification was 0.35 second per image in full sequence method, and that of the three versions - ASM-1, ASM-2, ASM-3 - were 0.27, 0.22, and 0.20, respectively.