Fix for handling non-Latin characters#7
Conversation
This change introduces support for text containing non-Latin characters (Hindi, Urdu, Greek, for example). This is done by printing `html.unescape(final_string)` instead of `final_string`.
|
@deepseagirl could you merge this after review? |
|
@deepseagirl hi, just sending a ping on this. thanks! |
|
@deepseagirl Can we close this? |
|
hi, thanks. this is a good improvement :) new default will be to decode character references: flag to turn decoding off: the html.unescape python doc links to this list of named character references which seemed handy. |
|
i'll finalize this when i have a few more mins. should be soon now that it's this far along. thanks again |
|
@deepseagirl no worries, and I realize you were not able to access a computer earlier, so it is no problem. the new changes look great! thank you & tc =) |
|
@deepseagirl can we close? |
This change introduces support for search results containing non-Latin characters as part of the URL or description.
This is done by passing the
final_stringvariable to thehtml.unescape()function (instead of printing it directly) at the lastprintcall.