Hi,
Quote Originally Posted by the.rampage.rado View Post
http://www.rampagehockey.eu/neo/grap...%B0-p-182.html here's the link for the problematic item with Bulgarian SEO url in browser but with junk in sitemap.

This is a test installation of 1.5.1 to test out things like cyrillic support and modules I'll be using.
You have the wrong address.
For example, this URL
Code:
/neo/graphics-cards-c-1_4/Свещ-Зимна-приказка-p-182.html
must be
Code:
/neo/graphics-cards-c-1_4/%D0%A1%D0%B2%D0%B5%D1%89-%D0%97%D0%B8%D0%BC%D0%BD%D0%B0-%D0%BF%D1%80%D0%B8%D0%BA%D0%B0%D0%B7%D0%BA%D0%B0-p-182.html
rfc3986 Uniform Resource Identifier (URI): Generic Syntax. 2. Characters
A URI is composed from a limited set of characters consisting of
digits, letters, and a few graphic symbols. A reserved subset of
those characters may be used to delimit syntax components within a
URI while the remaining characters, including both the unreserved set
and those reserved characters not acting as delimiters, define each
component's identifying data.
A good example is the URL's in Wikipedia - http://bg.wikipedia.org/wiki/Кирилица
Please note, the URL is
Code:
bg.wikipedia.org/wiki/%D0%9A%D0%B8%D1%80%D0%B8%D0%BB%D0%B8%D1%86%D0%B0
but not the
Code:
bg.wikipedia.org/wiki/Кирилица
Of course, I can add additional processing URL, make it valid with rfc3986.
But:
1. Standard function zen_href_link() is not needed additional processing. It return valid URLs.
2. This will not save you with the troubles indexing your site with an non valid URLs.