• Jan 04, 2024 News!IJFCC will adopt Article-by-Article Work Flow
  • Jun 03, 2024 News!Vol.13, No.2 has been published with online version.   [Click]
  • Dec 05, 2023 News!Vol.12, No.4 has been published with online version.   [Click]
General Information
    • ISSN: 2010-3751 (Print)
    • Frequency: Quarterly
    • DOI: 10.18178/IJFCC
    • Editor-in-Chief: Prof. Pascal Lorenz
    • Executive Editor: Ms. Tina Yuen
    • Abstracting/ Indexing: Crossref, Electronic Journals LibraryINSPEC(IET), Google Scholar, EBSCO, etc.
    • E-mail:  ijfcc@ejournal.net 
    • Article Processing Charge: 500 USD
Editor-in-chief

Prof. Pascal Lorenz
University of Haute Alsace, France
 
It is my honor to be the Editor-in-Chief of IJFCC. The journal publishes good papers in the field of future computer and communication. Hopefully, IJFCC will become a recognized journal among the readers in the filed of future computer and communication.

IJFCC 2016 Vol.5(1): 18-22 ISSN: 2010-3751
doi: 10.18178/ijfcc.2016.5.1.436

Character Mapping for Cross-Language

Mazin Al-Shuaili and Marco Carvalho

Abstract—Out-of-vocabulary words are a significant challenge for cross-language information retrieval. Names of people constitute a large portion of out-of-vocabulary words, as there are different methodologies to match names that are written in various languages. Some of the methods convert names to phonetic codes, such as Soundex, or transliterate names from one language to another. We propose a technique to map characters automatically from different languages into English, without human interference and without prior knowledge of the language. This technique can provide a statistical or phonetic model that can be used later for name comparisons or named transliterations into a cross-language. The method also generates Soundex codes for the source language based on English Soundex codes. We implement this technique for five languages: Arabic, Russian, Urdu, Hindi, and Persian. Five Soundex tables are provided as the result of this technique.

Index Terms—CLIR, data linkage, IR, name matching.

The authors are with the Florida Institute of Technology, Melbourne, FL 32901 USA (e-mail: malshuaili1994@my.fit.edu, mcarvalho@cs.fit.edu).

[PDF]

Cite: Mazin Al-Shuaili and Marco Carvalho, "Character Mapping for Cross-Language," International Journal of Future Computer and Communication vol. 5, no. 1, pp. 18-22, 2016.

Copyright © 2008-2024. International Journal of Future Computer and Communication. All rights reserved.
E-mail: ijfcc@ejournal.net