• ISSN: 1674-7461
  • CN: 11-5823/TU
  • 主管:中国科学技术协会
  • 主办:中国图学学会
  • 承办:中国建筑科学研究院有限公司

面向公路工程规范的多粒度知识提取与知识应用方法

Multi-Level Knowledge Extraction and Application Methods for Highway engineering Specifications

  • 摘要: 针对公路工程领域知识繁多而应用效率低的问题,提出面向公路规范类文本的多粒度知识提取与知识应用方法。在词语粒度上构建了公路工程领域词库;在语段粒度上提出TEARS定义,将复杂语段转换为结构化的三元组结构;在子句粒度上总结了四种主要句法,并各自设计了语义信息的抽取方法。以967本公路规范类文本为数据源,从中提取知识并构建了公路工程领域知识图谱,通过与深度学习方法比较验证了正确性,开发公路工程安全信息检索与应用系统。结果表明:该方法实现了非结构化公路规范类文本的知识提取,构建的领域知识图谱质量较高,满足工程应用需求。

     

    Abstract: Aiming at the problem of low efficiency in searching huge domain knowledge in the field of highway engineering, a multi-level knowledge extraction method for highway engineering specifications is proposed in the present paper. In the word level, a domain lexical database of highway engineering is constructed. In the paragraph level, a TEARS definition for highway engineering specifications is proposed, therefore unstructured paragraphs can be converted into structured triples. In the sentence level, four main sentence structures and their extraction methods for semantic information are designed respectively. The research constructs a domain knowledge graph of highway engineering by using above established methods and taking 967 highway engineering specifications as data source, and further develops a highway engineering safety information searching and application system. The result shows that the proposed methods can successfully extract knowledge from highway engineering specifications, and the constructed domain knowledge graph can fully meet the needs of engineering applications.

     

/

返回文章
返回