• <legend id='BRgdc'><style id='BRgdc'><dir id='BRgdc'><q id='BRgdc'></q></dir></style></legend>

    1. <tfoot id='BRgdc'></tfoot>

        <bdo id='BRgdc'></bdo><ul id='BRgdc'></ul>

      <small id='BRgdc'></small><noframes id='BRgdc'>

      1. <i id='BRgdc'><tr id='BRgdc'><dt id='BRgdc'><q id='BRgdc'><span id='BRgdc'><b id='BRgdc'><form id='BRgdc'><ins id='BRgdc'></ins><ul id='BRgdc'></ul><sub id='BRgdc'></sub></form><legend id='BRgdc'></legend><bdo id='BRgdc'><pre id='BRgdc'><center id='BRgdc'></center></pre></bdo></b><th id='BRgdc'></th></span></q></dt></tr></i><div id='BRgdc'><tfoot id='BRgdc'></tfoot><dl id='BRgdc'><fieldset id='BRgdc'></fieldset></dl></div>
      2. 如何检测 Latin1 编码列中的 UTF-8 字符 - MySQL

        How to detect UTF-8 characters in a Latin1 encoded column - MySQL(如何检测 Latin1 编码列中的 UTF-8 字符 - MySQL)
      3. <tfoot id='ptWg8'></tfoot>
          <legend id='ptWg8'><style id='ptWg8'><dir id='ptWg8'><q id='ptWg8'></q></dir></style></legend>
            <i id='ptWg8'><tr id='ptWg8'><dt id='ptWg8'><q id='ptWg8'><span id='ptWg8'><b id='ptWg8'><form id='ptWg8'><ins id='ptWg8'></ins><ul id='ptWg8'></ul><sub id='ptWg8'></sub></form><legend id='ptWg8'></legend><bdo id='ptWg8'><pre id='ptWg8'><center id='ptWg8'></center></pre></bdo></b><th id='ptWg8'></th></span></q></dt></tr></i><div id='ptWg8'><tfoot id='ptWg8'></tfoot><dl id='ptWg8'><fieldset id='ptWg8'></fieldset></dl></div>
                <tbody id='ptWg8'></tbody>

                <small id='ptWg8'></small><noframes id='ptWg8'>

                  <bdo id='ptWg8'></bdo><ul id='ptWg8'></ul>
                • 本文介绍了如何检测 Latin1 编码列中的 UTF-8 字符 - MySQL的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着跟版网的小编来一起学习吧!

                  问题描述

                  我即将承担将数据库从 Latin1 转换为 UTF-8 的繁琐且充满陷阱的任务.

                  I am about to undertake the tedious and gotcha-laden task of converting a database from Latin1 to UTF-8.

                  此时我只想检查我的表中存储了哪些类型的数据,因为这将决定我应该使用什么方法来转换数据.

                  At this point I simply want to check what sort of data I have stored in my tables, as that will determine what approach I should use to convert the data.

                  具体来说,我想检查 Latin1 列中是否有 UTF-8 字符,最好的方法是什么?如果只有几行受到影响,那么我可以手动修复此问题.

                  Specifically, I want to check if I have UTF-8 characters in the Latin1 columns, what would be the best way to do this? If only a few rows are affected, then I can just fix this manually.

                  选项 1. 执行 MySQL 转储并使用 Perl 搜索 UTF-8 字符?

                  Option 1. Perform a MySQL dump and use Perl to search for UTF-8 characters?

                  选项 2. 使用 MySQL CHAR_LENGTH 查找具有多字节字符的行?例如SELECT name FROM clients WHERE LENGTH(name) != CHAR_LENGTH(name);够了吗?

                  Option 2. Use MySQL CHAR_LENGTH to find rows with multi-byte characters? e.g. SELECT name FROM clients WHERE LENGTH(name) != CHAR_LENGTH(name); Is this enough?

                  目前我已将 Mysql 客户端编码切换为 UTF-8.

                  At the moment I have switched my Mysql client encoding to UTF-8.

                  推荐答案

                  字符编码,就像时区一样,是一个不断出现问题的根源.

                  Character encoding, like time zones, is a constant source of problems.

                  您可以做的是查找任何高位 ASCII"字符,因为这些字符要么是 LATIN1 重音字符或符号,要么是 UTF-8 多字节字符的第一个.除非你作弊,否则很难区分.

                  What you can do is look for any "high-ASCII" characters as these are either LATIN1 accented characters or symbols, or the first of a UTF-8 multi-byte character. Telling the difference isn't going to be easy unless you cheat a bit.

                  要确定哪种编码是正确的,您只需SELECT 两个不同的版本并进行视觉比较.举个例子:

                  To figure out what encoding is correct, you just SELECT two different versions and compare visually. Here's an example:

                  SELECT CONVERT(CONVERT(name USING BINARY) USING latin1) AS latin1, 
                         CONVERT(CONVERT(name USING BINARY) USING utf8) AS utf8 
                  FROM users 
                  WHERE CONVERT(name USING BINARY) RLIKE CONCAT('[', UNHEX('80'), '-', UNHEX('FF'), ']')
                  

                  这变得异常复杂,因为 MySQL 正则表达式引擎似乎忽略了诸如 \x80 之类的东西,并且必须使用 UNHEX() 方法来代替.

                  This is made unusually complicated because the MySQL regexp engine seems to ignore things like \x80 and makes it necessary to use the UNHEX() method instead.

                  这会产生如下结果:

                  latin1                utf8
                  ----------------------------------------
                  Bjrn                Bjrn
                  

                  这篇关于如何检测 Latin1 编码列中的 UTF-8 字符 - MySQL的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!

                  本站部分内容来源互联网,如果有图片或者内容侵犯了您的权益,请联系我们,我们会在确认后第一时间进行删除!

                  相关文档推荐

                  Bogus foreign key constraint fail(虚假外键约束失败)
                  how to get last insert id after insert query in codeigniter active record(如何在codeigniter活动记录中插入查询后获取最后一个插入ID)
                  Force InnoDB to recheck foreign keys on a table/tables?(强制 InnoDB 重新检查表/表上的外键?)
                  How to auto generate migrations with Sequelize CLI from Sequelize models?(如何使用 Sequelize CLI 从 Sequelize 模型自动生成迁移?)
                  Clear MySQL query cache without restarting server(无需重启服务器即可清除 MySQL 查询缓存)
                  ALTER TABLE to add a composite primary key(ALTER TABLE 添加复合主键)

                    <legend id='d519a'><style id='d519a'><dir id='d519a'><q id='d519a'></q></dir></style></legend>
                    • <bdo id='d519a'></bdo><ul id='d519a'></ul>
                      <i id='d519a'><tr id='d519a'><dt id='d519a'><q id='d519a'><span id='d519a'><b id='d519a'><form id='d519a'><ins id='d519a'></ins><ul id='d519a'></ul><sub id='d519a'></sub></form><legend id='d519a'></legend><bdo id='d519a'><pre id='d519a'><center id='d519a'></center></pre></bdo></b><th id='d519a'></th></span></q></dt></tr></i><div id='d519a'><tfoot id='d519a'></tfoot><dl id='d519a'><fieldset id='d519a'></fieldset></dl></div>
                      • <small id='d519a'></small><noframes id='d519a'>

                          <tbody id='d519a'></tbody>
                        • <tfoot id='d519a'></tfoot>