Weird URLs in Google Analytics

Joined
Jul 18, 2012
Messages
277
My client's WP site, which I originally developed about 4 years ago, is up for a new site. I'm reviewing the GA data to see which pages are of low value.

In Analytics, when I go to BEHAVIOR > SITE CONTENT > ALL PAGES, I see these URLs:
2019-07-31_13-59-12.png


When I click on any of those pages, they are all 404 because this is a WordPress site and I've never used .html extensions. I checked the Permalink Settings in WP, and they're the standard structure I always use: domain.com/%category%/%postname%/.

What could be causing this issue?
 

dannanelli

Member
Joined
May 24, 2014
Messages
38
My client's WP site, which I originally developed about 4 years ago, is up for a new site. I'm reviewing the GA data to see which pages are of low value.

In Analytics, when I go to BEHAVIOR > SITE CONTENT > ALL PAGES, I see these URLs:
2019-07-31_13-59-12.png


When I click on any of those pages, they are all 404 because this is a WordPress site and I've never used .html extensions. I checked the Permalink Settings in WP, and they're the standard structure I always use: domain.com/%category%/%postname%/.

What could be causing this issue?
I don’t have a lot of experience with this but something similar happened to me when I migrated a WP site to Liquid Web. For some reason after moving the site, all the pages loaded with /index.html appended to the end of them. Have you moved hosts or anything like that recently?
 

Eric Rohrback

Moderator
Joined
Oct 3, 2012
Messages
1,000
Was this a html site migration or did they use the domain in the past? It might be worth checking wayback machine to see if the domain was ever previously used (also check Whois "created date").

WP pages shouldn't have .html in the URLs because WP is php based. I would expect to see index.php not index.html.
 
Joined
Jul 18, 2012
Messages
277
I came across this issue on a site a week or two ago. Go into the Google Analytics Admin > View > View Settings and get rid of the Default Page. I'm betting it's set to index.html. Those numbers you're seeing aren't really for 404s. They are for the URL's minus the index.html suffix.
Wow- that's EXACTLY what I found! Thanks!
 
Top