Closed Bug 823110 Opened 12 years ago Closed 11 years ago

Sitemap is invalid, being ignored by Google

Categories

(developer.mozilla.org Graveyard :: General, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: openjck, Unassigned)

References

Details

User Agent: Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:18.0) Gecko/20100101 Firefox/18.0
Build ID: 20121213224155

Steps to reproduce:

1. Visit http://www.freeformatter.com/xml-validator-xsd.html
2. For "XML Input", enter https://developer.mozilla.org/sitemaps/en-US/sitemap.xml
3. For "XSD Input", enter http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd


Actual results:

The following error is reported. According to Google Webmaster Tools, the Sitemap is being ignored as a result.

Cvc-elt.1.a: Cannot Find The Declaration Of Element 'urlset'.. Line '2', Column '62'.


Expected results:

The Sitemap should validate, and Google should be using it.
Commits pushed to master at https://github.com/mozilla/kuma

https://github.com/mozilla/kuma/commit/3c481bb57d12c922fb2a243943383b0b5a005a2e
fix bug 823110 - Create valid sitemap

https://github.com/mozilla/kuma/commit/4c158e23caae13d4100f139c5ee89e0921a59633
Merge pull request #778 from darkwing/sitemap-823110

fix bug 823110 - Create valid sitemap
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Errors on https://developer.allizom.org/sitemaps/en-US/sitemap.xml

White Spaces Are Required Between PublicId And SystemId.. Line '1', Column '50'.
White Spaces Are Required Between PublicId And SystemId.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Did you check this within Google Webmaster Tools?
Still being ignored by Google. They do not provide a reason, but the STR from comment 0 result in the following error message.

Cvc-elt.1.a: Cannot Find The Declaration Of Element 'urlset'.. Line '2', Column '62'.
Raymond, I have set you up with an account on Google Webmaster Tools. In the future, can you please incorporate their results in your testing? You can navigate to Health > Blocked URLs to check whether Google is accepting the Sitemap.
(In reply to John Karahalis [:openjck] from comment #5)
> Raymond, I have set you up with an account on Google Webmaster Tools. In the
> future, can you please incorporate their results in your testing? You can
> navigate to Health > Blocked URLs to check whether Google is accepting the
> Sitemap.

John,  I will. Did you use my bugzilla account as the email or my mozilla.com one?
Frustrating.  I know it's not a basic code issue because copying my local sitemap into that validator works.  Will continue looking.
OK, quick summary:

When using the validator cited above (http://www.freeformatter.com/xml-validator-xsd.html), and copy/pasting the US sitemap, I got the same error message as previously reported:

"Cvc-elt.1.a: Cannot Find The Declaration Of Element 'urlset'.. Line '2', Column '62'."

My original bug fix removed the "s" in the <urlset>'s xmlns attribute, and that fixed the issue then, and fixes the issue now.  

This leads me to believe the sitemap cron either (a) isn't working or (b) hasn't run since my fix.  I will work with groovecoder (he set up the cron) to get this resolved.
Depends on: 799662
Blocks: 818951
Errors are being reported under the heading Health > Blocked URLs in GWT. Right now, it is reporting this:

> Line 3: Sitemap: sitemap.xml    Invalid sitemap URL detected; syntax not understood
Just resubmitted this sitemap.
Status: REOPENED → RESOLVED
Closed: 12 years ago11 years ago
Resolution: --- → FIXED
Moving to General component.
Component: SEO → General
Product: developer.mozilla.org → developer.mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.