Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT



9c and 9d are undefined
New swap of 
'\xe2809c'  #  0xE2809c -> QUOTATION MARK
'\xe2809d' # 0xE2809d -> QUOTATION MARK

just before the    # 0x09 → UNDEFINED

Image Added

e2809dhttps://www.fileformat.info/info/unicode/char/201d/index.htm

add 0x9c and 0x9d after 0x9F

0x81

0x8d

0x8f


0x90

0x9d

modify cp1252.py

C:\Python37\Lib\encodings

'\x81'     #  0x20 -> SPACE
'\x8d' # 0x20 -> SPACE
'\x8f' # 0x20 -> SPACE
'\x90' # 0x20 -> SPACE
'\x9d' # 0x20 -> SPACE
and hash 


Code Block
titleerror
collapsetrue
C:\Python37\python.exe C:/Users/jkriker/Documents/GitHub/html_parser_HelpNDoc/call_other_script_with_list_files.py
Traceback (most recent call last):
  File "html_parser_HelNDoc.py", line 265, in <module>
    main(sys.argv[1:])
  File "html_parser_HelNDoc.py", line 244, in main
    soup = getfile(filename)
  File "html_parser_HelNDoc.py", line 49, in getfile
    soup = BeautifulSoup(response.read(), 'html.parser')
  File "C:\Python37\lib\encodings\cp1252.py", line 23, in decode
    return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x8f in position 3144: character maps to <undefined>


...