chupakabra Posted February 6, 2006 Report Posted February 6, 2006 Does uTorrent support RSS feeds with UTF-8 encoding?
chupakabra Posted February 6, 2006 Author Report Posted February 6, 2006 Strange. I cannot make it to display feeds with cyrillic letters in it. uTorrent shows some garbage instead of titles. I verified the same feed with Firefox and Opera and it is displayed correctly there. Can you please suggest anything?
Firon Posted February 6, 2006 Report Posted February 6, 2006 Make sure the feed is actually UTF-8, it's quite common for it to be some other encoding.
Inf Posted February 6, 2006 Report Posted February 6, 2006 if its win-1251 it should work as well, but you have to swtich the system locale to russiananyway, what is the feed ?
Firon Posted February 6, 2006 Report Posted February 6, 2006 Inf: I don't think that will work for µTorrent, because the program is internally UTF-16 and is (supposed to be) unaffected by the non-unicode setting
Inf Posted February 6, 2006 Report Posted February 6, 2006 System locale still used as a default for MultiByteToWideChar calls when mapping from CP_ACP (default 'ascii' codepage), and probably also used by the rest of similar API calls. So its still worth a try.
chupakabra Posted February 6, 2006 Author Report Posted February 6, 2006 Make sure the feed is actually UTF-8, it's quite common for it to be some other encoding.I am sure that it uses UTF-8 encoding. Firefox shows encoding as "Unicode (UTF-8)". Also it has UTF-8 specifed as document encoding:<?xml version="1.0" encoding="UTF-8"?>if its win-1251 it should work as well, but you have to swtich the system locale to russianYes, feeds with win-1251 encoding work properly even without switching locale. But this one uses UTF-8 and it does not work.anyway, what is the feed ?It is from a private tracker. I think you will be unable to access it.
Firon Posted February 6, 2006 Report Posted February 6, 2006 ah, I see. Perhaps it doesn't support it properly then.
Inf Posted February 6, 2006 Report Posted February 6, 2006 Indeed, it seems to be some kind of bug, i just found a UTF8 feed (non-torrent related, neither russian, but uses cyrillic charaters + UTF8) and tried to put it in utorrent. The releases list looks like a set of garbaged characters.This is the feed, open in a browser and switch to UTF8 to see how the titles should look like:http://alliance.maidanua.org/node/feedFiron: can you move it to "Found Bugs" ?
amonrei Posted February 7, 2006 Report Posted February 7, 2006 Same with tokyotosho. Titles with Japanese word in it won't display correctly.
chupakabra Posted February 8, 2006 Author Report Posted February 8, 2006 I tested build 420 and it works properly with UTF-8 now. Thank you. But there are some problems still:1. Feed using Windows-1251 encoding is now broken (it was working before).2. Looks like uTorrent does not decode some special encoded characters. For example, here is a title from XML source:<title>Fragma - Toca& #039;s Miracle [2005] (Music Video)</title>uTorrent displays it as you see it (with & #039;), but Firefox shows more correct version:Fragma - Toca's Miracle [2005] (Music Video)I added space between & and #039; so that this sequence is not interpreted by browser.
chupakabra Posted February 9, 2006 Author Report Posted February 9, 2006 Tested build 421: problem 1 is fixed, problem 2 is still present.
chupakabra Posted February 10, 2006 Author Report Posted February 10, 2006 Firon, I think you don't understand me. I don't have problems with feed encoding anymore, both UTF-8 and Win-1251 are displayed correctly with build 421. The problem is that uTorrent does not decode characters which are encoded like this one: & #039; I think 039 is just a character code, in this case character is apostrophe ('). Please see example in one of my previos posts.
Firon Posted February 10, 2006 Report Posted February 10, 2006 indeed, I totally misread your post. I'll mention this to ludde
chupakabra Posted February 19, 2006 Author Report Posted February 19, 2006 Tested build 426 and it looks like problem with xx; sequences is fixed. But there is still one more bug: uTorrent does not decode sequences like " (which should be " ), & (&)
Recommended Posts
Archived
This topic is now archived and is closed to further replies.