module CharDet
Ruby on Rails 2.3.18
Since v2.3.18 Last seen in v2.3.18Big5 frequency table by Taiwan’s Mandarin Promotion Council <www.edu.tw:81/mandr/>
128 –> 0.42261 256 –> 0.57851 512 –> 0.74851 1024 –> 0.89384 2048 –> 0.97583
Ideal Distribution Ratio = 0.74851/(1-0.74851) =2.98 Random Distribution Ration = 512/(5401-512)=0.105
Typical Distribution Ratio about 25% of Ideal one, still much higher than RDR
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
Jeff Hodges
Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s)
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
128 –> 0.79 256 –> 0.92 512 –> 0.986 1024 –> 0.99944 2048 –> 0.99999
Idea Distribution Ratio = 0.98653 / (1-0.98653) = 73.24 Random Distribution Ration = 512 / (2350-512) = 0.279.
Typical Distribution Ratio
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
128 –> 0.42261 256 –> 0.57851 512 –> 0.74851 1024 –> 0.89384 2048 –> 0.97583
Idea Distribution Ratio = 0.74851/(1-0.74851) =2.98 Random Distribution Ration = 512/(5401-512)=0.105
Typical Distribution Ratio about 25% of Ideal one, still much higher than RDR
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
512 –> 0.79 – 0.79 1024 –> 0.92 – 0.13 2048 –> 0.98 – 0.06 6768 –> 1.00 – 0.02
Ideal Distribution Ratio = 0.79135/(1-0.79135) = 3.79 Random Distribution Ration = 512 / (3755 - 512) = 0.157
Typical Distribution Ratio about 25% of Ideal one, still much higher that RDR
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
windows-1255 / ISO-8859-8 code points of interest
128 –> 0.77094 256 –> 0.85710 512 –> 0.92635 1024 –> 0.97130 2048 –> 0.99431
Ideal Distribution Ratio = 0.92635 / (1-0.92635) = 12.58 Random Distribution Ration = 512 / (2965+62+83+86-512) = 0.191
Typical Distribution Ratio, 25% of IDR
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
Character Mapping Table: this table is modified base on win1251BulgarianCharToOrderMap, so only number <64 is sure valid
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is
Simon Montagu
Portions created by the Initial Developer are Copyright © 2005 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code Shoshannah Forbes - original C code (?)
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Communicator client code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code Proofpoint, Inc.
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code Proofpoint, Inc.
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is Mozilla Universal charset detector code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 2001 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python Shy Shalom - original C code
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
BEGIN LICENSE BLOCK ######################## The Original Code is mozilla.org code.
The Initial Developer of the Original Code is Netscape Communications Corporation. Portions created by the Initial Developer are Copyright © 1998 the Initial Developer. All Rights Reserved.
Contributor(s):
Jeff Hodges - port to Ruby Mark Pilgrim - port to Python
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
END LICENSE BLOCK #########################
Namespace
Classes
- CharDet::Big5DistributionAnalysis
- CharDet::Big5Prober
- CharDet::CharDistributionAnalysis
- CharDet::CharSetGroupProber
- CharDet::CharSetProber
- CharDet::CodingStateMachine
- CharDet::EscCharSetProber
- CharDet::EUCJPContextAnalysis
- CharDet::EUCJPDistributionAnalysis
- CharDet::EUCJPProber
- CharDet::EUCKRDistributionAnalysis
- CharDet::EUCKRProber
- CharDet::EUCTWDistributionAnalysis
- CharDet::EUCTWProber
- CharDet::GB2312DistributionAnalysis
- CharDet::GB2312Prober
- CharDet::HebrewProber
- CharDet::JapaneseContextAnalysis
- CharDet::Latin1Prober
- CharDet::MBCSGroupProber
- CharDet::MultiByteCharSetProber
- CharDet::SBCSGroupProber
- CharDet::SingleByteCharSetProber
- CharDet::SJISContextAnalysis
- CharDet::SJISDistributionAnalysis
- CharDet::SJISProber
- CharDet::UniversalDetector
- CharDet::UTF8Prober
Constants
- CharDet::ACO
- CharDet::ACV
- CharDet::ASC
- CharDet::ASO
- CharDet::ASS
- CharDet::ASV
- CharDet::Big5CharLenTable
- CharDet::Big5CharToFreqOrder
- CharDet::BIG5_cls
- CharDet::Big5SMModel
- CharDet::BIG5_st
- CharDet::BIG5_TABLE_SIZE
- CharDet::BIG5_TYPICAL_DISTRIBUTION_RATIO
- CharDet::BulgarianLangModel
- CharDet::CLASS_NUM
- CharDet::DONT_KNOW
- CharDet::EDetecting
- CharDet::EError
- CharDet::EEscAscii
- CharDet::EFoundIt
- CharDet::EHighbyte
- CharDet::EItsMe
- CharDet::ENotMe
- CharDet::ENOUGH_DATA_THRESHOLD
- CharDet::ENOUGH_REL_THRESHOLD
- CharDet::EPureAscii
- CharDet::EStart
- CharDet::EUCJPCharLenTable
- CharDet::EUCJP_cls
- CharDet::EUCJPSMModel
- CharDet::EUCJP_st
- CharDet::EUCKRCharLenTable
- CharDet::EUCKRCharToFreqOrder
- CharDet::EUCKR_cls
- CharDet::EUCKRSMModel
- CharDet::EUCKR_st
- CharDet::EUCKR_TABLE_SIZE
- CharDet::EUCKR_TYPICAL_DISTRIBUTION_RATIO
- CharDet::EUCTWCharLenTable
- CharDet::EUCTWCharToFreqOrder
- CharDet::EUCTW_cls
- CharDet::EUCTWSMModel
- CharDet::EUCTW_st
- CharDet::EUCTW_TABLE_SIZE
- CharDet::EUCTW_TYPICAL_DISTRIBUTION_RATIO
- CharDet::FINAL_KAF
- CharDet::FINAL_MEM
- CharDet::FINAL_NUN
- CharDet::FINAL_PE
- CharDet::FINAL_TSADI
- CharDet::FREQ_CAT_NUM
- CharDet::GB2312CharLenTable
- CharDet::GB2312CharToFreqOrder
- CharDet::GB2312_cls
- CharDet::GB2312SMModel
- CharDet::GB2312_st
- CharDet::GB2312_TABLE_SIZE
- CharDet::GB2312_TYPICAL_DISTRIBUTION_RATIO
- CharDet::GreekLangModel
- CharDet::HebrewLangModel
- CharDet::HungarianLangModel
- CharDet::HZCharLenTable
- CharDet::HZ_cls
- CharDet::HZSMModel
- CharDet::HZ_st
- CharDet::IBM855_CharToOrderMap
- CharDet::Ibm855Model
- CharDet::IBM866_CharToOrderMap
- CharDet::Ibm866Model
- CharDet::ISO2022CNCharLenTable
- CharDet::ISO2022CN_cls
- CharDet::ISO2022CNSMModel
- CharDet::ISO2022CN_st
- CharDet::ISO2022JPCharLenTable
- CharDet::ISO2022JP_cls
- CharDet::ISO2022JPSMModel
- CharDet::ISO2022JP_st
- CharDet::ISO2022KRCharLenTable
- CharDet::ISO2022KR_cls
- CharDet::ISO2022KRSMModel
- CharDet::ISO2022KR_st
- CharDet::JISCharToFreqOrder
- CharDet::JIS_TABLE_SIZE
- CharDet::JIS_TYPICAL_DISTRIBUTION_RATIO
- CharDet::KOI8R_CharToOrderMap
- CharDet::Koi8rModel
- CharDet::Latin1_CharToClass
- CharDet::Latin1ClassModel
- CharDet::Latin2_HungarianCharToOrderMap
- CharDet::Latin2HungarianModel
- CharDet::Latin5_BulgarianCharToOrderMap
- CharDet::Latin5BulgarianModel
- CharDet::Latin5CyrillicModel
- CharDet::Latin7_CharToOrderMap
- CharDet::Latin7GreekModel
- CharDet::LOGICAL_HEBREW_NAME
- CharDet::MacCyrillicModel
- CharDet::MAX_REL_THRESHOLD
- CharDet::MIN_FINAL_CHAR_DISTANCE
- CharDet::MINIMUM_DATA_THRESHOLD
- CharDet::MINIMUM_THRESHOLD
- CharDet::MIN_MODEL_DISTANCE
- CharDet::NEGATIVE_SHORTCUT_THRESHOLD
- CharDet::NORMAL_KAF
- CharDet::NORMAL_MEM
- CharDet::NORMAL_NUN
- CharDet::NORMAL_PE
- CharDet::NORMAL_TSADI
- CharDet::NUMBER_OF_SEQ_CAT
- CharDet::NUM_OF_CATEGORY
- CharDet::ONE_CHAR_PROB
- CharDet::OTH
- CharDet::POSITIVE_CAT
- CharDet::POSITIVE_SHORTCUT_THRESHOLD
- CharDet::RussianLangModel
- CharDet::SAMPLE_SIZE
- CharDet::SB_ENOUGH_REL_THRESHOLD
- CharDet::SHORTCUT_THRESHOLD
- CharDet::SJISCharLenTable
- CharDet::SJIS_cls
- CharDet::SJISSMModel
- CharDet::SJIS_st
- CharDet::SURE_NO
- CharDet::SURE_YES
- CharDet::SYMBOL_CAT_ORDER
- CharDet::ThaiLangModel
- CharDet::TIS620CharToOrderMap
- CharDet::TIS620ThaiModel
- CharDet::UCS2BECharLenTable
- CharDet::UCS2BE_cls
- CharDet::UCS2BESMModel
- CharDet::UCS2BE_st
- CharDet::UCS2LECharLenTable
- CharDet::UCS2LE_cls
- CharDet::UCS2LESMModel
- CharDet::UCS2LE_st
- CharDet::UDF
- CharDet::UTF8CharLenTable
- CharDet::UTF8_cls
- CharDet::UTF8SMModel
- CharDet::UTF8_st
- CharDet::VERSION
- CharDet::VISUAL_HEBREW_NAME
- CharDet::Win1250HungarianCharToOrderMap
- CharDet::Win1250HungarianModel
- CharDet::Win1251BulgarianModel
- CharDet::Win1251CyrillicModel
- CharDet::Win1253_CharToOrderMap
- CharDet::Win1253GreekModel
- CharDet::Win1255_CharToOrderMap
- CharDet::Win1255HebrewModel
Methods (defined here)
- self. detect