The Korean Association of Language Sciences

전국우수 학회와 맞먹는 연구성과를 위해 학술대회와 편집/심사기능을 보다 강화하겠습니다.

논문자료실

pISSN: 1225-2522


언어과학, Vol.29 (2022)
pp.181~212

DOI : 10.14384/kals.2022.29.4.181

법률상담 도메인의 자연어이해 모델 학습을 위한 언어자원 구축 방법론

황창회

(한국외국어대학교/대학원생)

남지순

(한국외국어대학교/교수)

This study proposes a methodology for constructing linguistic resources to train Natural Language Understanding (NLU) models for the legal counseling service. A dataset based on the language resources we propose is essential for developing non-face-to-face legal services that provide information related to legal problems. The linguistic resources were constructed through a bottom-up analysis of linguistic patterns of legal expressions, background descriptions, and discourse types in online legal counseling texts. Moreover, we analyzed the hierarchical classification of keywords in existing legal service systems and newly determined 20 keywords that belong to 4 representative legal categories. Local Grammar Graphs (LGGs), effective in describing local linguistic phenomena, were adopted to describe various linguistic patterns in this domain. These local language patterns, modularized in LGG format, are converted into Finite State Transducers (FSTs) and generate datasets required for training a language model for NLU. To evaluate this processing, we trained an NLU model of the open-source chatbot architecture Rasa with our dataset. The model performance shows a 0.91 f1-score, which affirms that the linguistic resources and the methodology proposed in this study can be practically applied in developing legal counseling chatbot systems.
  자연어이해,법률 상담 시스템,목적지향 대화 시스템,챗봇,부분문법 그래프,언어자원

Download PDF list