한국어 정보검색에서 N-GRAM 이용한 미등록어 색인 방법 [韩语论文]-外语论文网

When indexing korean document for information retrieval, the general practice is to index nouns using phrase and morpheme analysis. However, difficulties lie in indexing those unknown words in the dictionary, a commonly used reference tool for morpheme analysis. Such unknown words can include proper nouns, borrowed words, and professional terms, and they can be a key index for information retrieval.

The N-GRAM, with its non-linguistic features, is characterized by faster processing speed, the ability to index unknown words not listed in the morpheme dictionary, and is effective for separating compound nouns.On the other hand, it can extract unrelated index words which lead to taking up too much of memory space and can degrade search efficiency.

In order to make up for such weak points of N-GRAM, this study suggests that uninflected words and conjugated words be extracted as index words first and that N-GRAM be applied at the stage for processing unknown words. Also, experiments showed that, with the same retrieval system, application of N-GRAM to the indexing algorithm for unknown words helped it perform better than other algorithms.

，韩语论文题目，韩语论文范文

高职院校韩语系建设的几点思考	항공사의 지각된 서비스품질이 실용적	도시지역 여성결혼이민자의 재사회화
韩国电影剧本中会话含义的略论探讨	韩国跆拳道运动的文化价值观探讨	영어 문장구조에 대한 이해가 읽기와 듣
형태 초점 접근법을 활용한 한국어 대조	한·중 사동 표현의 대조 연구	깔뱅의 기도론 연구
汉韩常用颜色词对比探讨	영어권 학습자를 위한 한국어 교재 구성	모야모야 환아의 수술 후 자기효능감,
TV 포맷의 새로운 유형화 : 이야기, 놀이	중국인 학습자를 위한 한국어 거절 화행	한국과 독일의 중등교육단계에서의 진로