Hpricot and utf-8
I tried to use Hpricot to parse a page with special characters in a utf-8 encoding. The docs tell you to do this:
However, this won’t give you the output you want. The open method on Open-URI leaves the output in the default character set of the page. If you want to convert it to utf-8, you need to use the iconv library:
机器瞎学/数据掩埋/模式混淆/人工智障/深度遗忘/神经掉线/计算机幻觉/专注单身二十五年