Main Page
About FCIT
History
Strategy
Administration>
Current Administration
Prevouis Administration
Organization Strucutre
Industrial Advisory Board
PhotoAlbum
Lab Guides
Departments
Computer Science
Information Technology
Information Systems
Academics
Bachelor Programs
Graduate Programs
Executive Programs
Academic Calendar
Admission
Bachelor Degree & Transferring
Admission from the Foundation Year
Transferring to the Faculty
Graduate Studies
Graduate Programs
Executive Programs
International students
Scientific Research
Research Groups and Centers
Research Interests
Distinguished Scientists Program
Faculty Journal
Researcher’s Diwaniya
Researches
Faculty and Staff
Faculty
CS Department
IT Department
IS Department
Staff
Accreditation Integration & Management System (AIM
Development and Quality Unit
Work at FCIT
Capabilities Under the Spotlight
Code of Ethics
Students
Bachelor
ِAcademic Services
Preparatory Year Courses
Students' Guide
Academic Advising
Laboratories and Facilities
Student rights and duties
Graduate
Polices and Regulations
Students' Guide
Student's Handbook
New Student Orientation
Templates of proposals and theses for masters and
Courses
CS Program
IT Program
IS Program
Alumni Registration
Students Activities
Entrepreneurship Club
Cybersecurity Club
Data Science Club
Programming Club
Partnership
Industrial partnerships
Cisco Academy
Microsoft Academy
Oracle Academy
Community Services
عربي
English
About
Admission
Academic
Research and Innovations
University Life
E-Services
Search
Faculty of Computing and Information Technology
Document Details
Document Type
:
Article In Conference
Document Title
:
Language Identification in Document Analysis (LIDA)
التعرف على اللغة في تحليل الوثائق (LIDA)
Subject
:
Language Identification of a text Arabic or English
Document Language
:
English
Abstract
:
This paper presents a technique that can be used to discriminate between texts written in Arabic script and texts written in Latin script. This technique addresses the language identification problem on the word level and on the text line level. This technique uses an algorithm for horizontal projection profiles. This paper presents a new algorithm of language identification to determine languages of a document. This approach may be used in identifying the language in many applications. These applications cover encoding of document pages, language specific web crawling, information retrieval, natural language processing, text mining, translation service bureau software, spell checking software, stemming or morphological analyzers, and knowledge management systems.
Conference Name
:
International Conference Circuits, Signals, and Systems
Publishing Year
:
1424 AH
2004 AD
Article Type
:
Article
Conference Place
:
Florida – USA
Organizing Body
:
IASTED
Added Date
:
Thursday, March 3, 2011
Researchers
Researcher Name (Arabic)
Researcher Name (English)
Researcher Type
Dr Grade
Email
كمال جمبي
Jambi, Kamal
Researcher
Doctorate
kjambi@kau.edu.sa
Files
File Name
Type
Description
29208.docx
docx
Back To Researches Page