شماره ركورد
16921
عنوان
تطبيق خودكار سوالات متداول و بازيابي پاسخ با استفاده از مدلهاي NLP
سال تحصيل
1402
استاد راهنما
دكترحسن نادري
چکيده
FAQ Retrieval systems are essential for providing quick access to commonly requested information for educational and commercial organisations and for providing customer support. The current methods of lexical retrieval (such as rule-based systems and term frequency models) may provide efficient processing systems; however, they do not have the ability to process paraphrased, multilingual, and semantically complex queries from users. The introduction of dense embeddings and the use of transformer-based models have improved the ability of the systems to find the appropriate semantic match with questions. However, many of the retrieval systems currently available today still face challenges of having a high computational cost, only trained on small language-specific datasets, and provide inadequate support for Cross-lingual retrieval, especially for less common Languages like Arabic.
An overview of the most advanced techniques for finding frequently asked questions (FAQs) is included in this seminar. In addition to giving a description of the newest techniques being utilized in the retrieval of FAQs, such as traditional lexical methodologies, dense semantic retrieval, and hybrid transformer approaches, a detailed description of a new system for Arabic-English FAQ retrieval that is based on the dense embedding of questions and answers is included. This system utilizes pre-trained multilingual models, including Sentence-BERT, and the open-source dataset for WebFAQs to efficiently create a common semantic space for both questions and FAQs. By using cosine similarity to efficiently calculate similarity, and FAISS for indexing, this system provides an efficient way to perform real-time cross-lingual retrieval of FAQs without requiring either excessive training in a specific domain or extensive training on a large quantity of documents. The focus of the developed framework is to balance between computational efficiency and the strength of the semantic representation of multilingual FAQs. Finally, the workshop provides an overview of the anticipated benefits and limitations of this new system, and discusses potential directions for further research on scalable multilingual systems for the retrieval of FAQs.
نام دانشجو
ود العامري
تاريخ ارائه
2/18/2026 12:00:00 AM
متن كامل
89867
پديد آورنده
ود العامري
تاريخ ورود اطلاعات
1404/12/04
عنوان به انگليسي
Automatic FAQ Matching and Answer Retrieval Using NLP Models
كليدواژه هاي لاتين
Natural Language Processing (NLP) , Automatic Question Matching , Answer Retrieval , Question-Answering Systems (QA) , Deep Learning Models