Abstract: Grid company archive data has the characteristics of multimodality, mass and diversity, covering structured, semi-structured and unstructured data. These data have the problems of low ...