Pythonスクリプトで『UnicodeDecodeError: ’ascii’ codec can’t decode byte 0xc3』というエラーがでたときの対処

release: 2017-08-01 update: 2020-09-21

PythonのスクリプトでDBから日本語を含む文字列を扱った際、『UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3』といったエラーが出た。これは、Pythonでは明示的に指定しないとデフォルトエンコーディングがasciiになっているため、UTF-8がそのまま利用できないのが理由のようだ。

blacknon@BS-PUB-UBUNTU-01:~$ python
Python 2.7.12 (default, Nov 19 2016, 06:48:10)
[GCC 5.4.0 20160609] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import sys
>>> sys.getdefaultencoding()
'ascii'

以下の記述をスクリプトに追記することで、デフォルトのエンコーディングがUTF-8方式になるので、それで対処できる。

import sys
reload(sys)
sys.setdefaultencoding('utf-8')

俺的備忘録〜なんかいろいろ〜

Blog

Documents

Tools

Pythonスクリプトで『UnicodeDecodeError: ’ascii’ codec can’t decode byte 0xc3』というエラーがでたときの対処

俺的備忘録

〜なんかいろいろ〜

最近の投稿

gitで直近のmergeで発生した差分だけをgit diffで取得する

git diffの結果をフルパスで表示させる

Python 3.9でasync使用時に『can't register atexit after shutdown』というエラーが出るようになった

xargsで各引数ごとの出力の先頭を色分けして表示する

コンソール上でひらがな、カタカナの文字を一括指定して置換する

Twitter

Sponsored Link

Other Page

Sponsored Link

最近の投稿

Twitter

Sponsored Link