#title Charset Detecting [[TableOfContents]] = °³¿ä = * °¢ ¾ð¾îº° character set detecting À» À§ÇÑ library ¹× API Á¤¸® * Charset detecting ¿¡´Â Å©°Ô 2°¡Áö open µÈ library ¸¦ ÀÌ¿ë * [http://www.mozilla.org/projects/intl/chardet.html Mozilla Universal Charset Detect] * [http://icu-project.org/userguide/charsetDetection.html IBM International Components for Unicode Project] = ¾ð¾îº° API = || ¾ð¾î || API À̸§ || URL || || C++ || Mozilla (Original) || http://mxr.mozilla.org/mozilla-central/source/intl/chardet/src/ || || C/C++ || libchardet || http://kldp.org/node/103169 || || C# || Nchardet || http://www.conceptdevelopment.net/Localization/NCharDet/ || || Java || Jchardet || http://jchardet.sourceforge.net/ || || Python || chardet || http://chardet.feedparser.org/ || || Ruby || chardet || https://rubygems.org/search?query=chardet || || Perl || Encode-Detect || http://search.cpan.org/~jgmyers/Encode-Detect-1.01/Detector.pm || || PHP || mod_chardet || http://cvs.oops.org/?cvsroot=PHP-Module&module=mod_chardet&file=README,v&rev=1.3 || = ¾ð¾îº° API Ư¡ = == Original Mozilla Code == * C++ ±â¹Ý * browser Äڵ忡¼­ ºÐ¸®ÇØ ³»±â°¡ ½±Áö ¾Ê´Ù. * ¿äÁò ¹öÀüÀÇ °æ¿ì intl/chardet/test ¿¡ sample source °¡ ÀÖ¾î ¹«¾ð°¡ ¿©Áö°¡ ÀÖÀ» µí.. == Nchardet == * C# ¿ë API * Mozilla Universal Charset Detect Algorithm * »ìÆìº¸Áö ¾Ê¾ÒÀ½ == Jchardet == * Java ¿ë chardet class API * Mozilla Universal Charset Detect Algorithm * sourceforge ¿¡¼­ ÁøÇà Áß * ¿ª½Ã »ìÆì º¸Áö´Â ¾Ê¾ÒÀ½ == py-chardet == * Python API * Mozilla Universal Charset Detect Algorithm * python pure code ·Î ÀÛ¼º * ¼Óµµ°¡ ´À¸° ´ÜÁ¡ * ±×·¡µµ °¡Àå À¯¸íÇÔ. (°Ë»ö¿¡ °¡Àå ¸¹ÀÌ °É¸²) * °³¹ßÀÌ Áß´ÜÀÌ µÈ °Í °°°í, ¸î¸î forking versionµéÀÌ ÀÖÀ½ * https://github.com/sigmavirus24/charade * https://pypi.python.org/pypi/chardet * py-chardet c binding module * pure code·Î ÀÛ¼ºÇÑ °ÍÀÌ ´À¸° ÀÌÀ¯·Î c bindingÀ¸·Î Á¦ÀÛ * http://mirror.oops.org/pub/oops/python/chardet/ * libchardet ÀÌ ÇÊ¿ä * http://mirror.oops.org/pub/oops/libchardet/ == chardet ruby == * gemÀ¸·Î ¿©·¯°¡Áö¸¦ Áö¿øÇÑ´Ù. * https://rubygems.org/search?query=chardet ÂüÁ¶ == Encode-Detect == * Perl API * Mozilla Universal Charset Detect Algorithm * C++ binding ÀÌ¶ó¼­ ¼Óµµ°¡ ÁÁ´Ù. * ¸î¸î ¾ð¾î¿¡ ´ëÇÏ¿© ¹ö±×°¡ ÀÖ´Ù. * ¹ö±×°¡ ÀÖÀ¸³ª, python chardet º¸´Ù´Â Á¤È®µµ°¡ ³ô´Ù == libchardet == * C/C++ library ( libchardet.so/libchardet.a support) * Mozilla Universal Charset Detect Algorithm * Encode-Detect 1.01 ÀÇ C++ binding Äڵ带 ¼öÁ¤ * C wrapping API Á¦°ø * Encode-Detect ÀÇ ¹ö±× ¼öÁ¤ * python chardet/Encode-Detect º¸´Ù´Â Á¤È®µµ°¡ ³ô´Ù * 2010/07 ÇöÀç libchardet ÀÌ ¿©·¯°³°¡ Á¸ÀçÇÑ´Ù. ´ëºÎºÐ Encode-DetectÀÇ C++ binding À¸·Î, frontend API°¡ Á¶±Ý¾¿ ´Ù¸£´Ù. * http://svn.oops.org/wsvn/OOPS.libchardet * http://code.google.com/p/libibase/source/browse/#svn/trunk/devel/libchardet * http://github.com/cnangel/libchardet/ == mod_chardet == * PHP API * libchardet / libicu / Python C API binding * Mozilla Universal Charset Detect Algorithm * ICU Conversion Charset Detection Alghrotim * Python Chardet bindng (with Python C API) * Requrires: libchardet that is distributed from oops.org