123ArticleOnline Logo
Welcome to 123ArticleOnline.com!
ALL >> Education >> View Article

How To Check Which Urls Have Been Indexed Without Upsetting Google

Profile Picture
By Author: Aleisha
Total Articles: 1
Comment this article
Facebook ShareTwitter ShareGoogle+ ShareTwitter Share

How can I learn which pages have not been indexed by Google, and do so in a way that has not infringed Google's rules? Google does not indicate whether a page has been indexed in the Google Search Console, does not let us erase the search results to get the answer and does not want to indirectly get the response from an undocumented API.

Use Google Query Explorer and download it as tsv
You can then download your XML sitemap locally and open it in Excel. Then, drag it into the Excel window, and you will get the "Import XML" dialog box. If it asks you to "Open the file without applying a style sheet", select OK:

Importing an xml sitemap into excel
Next, choose to open the "As an XML table" file:

Import xml as table in excel
You can delete the foreign columns by retaining only the column "ns1: loc" (or "loc"):

Remove xml sitemap irrelevant columns after importing to Excel
Then you just need to do a VLOOKUP or some other form of Excel matching and find the URLs in the sitemap that are not present in the analysis data.

I thought it was a simple but smart solution, and although ...
... a good starting point, I was afraid it did not show exactly which pages were indexed by Google. It is not uncommon for pages to receive little or no traffic, even if they are indexed. It may be an indication that the page is not indexed, but it can also show that the page has a marking problem, has become useless, needs some optimization to improve its visibility or simply is not Present in the XML Sitemap.

The log file solution:
Server log files are an excellent source of data on your website that is often inaccessible by other means. One of the many information that can be derived from these log files is whether or not a certain bot has accessed your website. In our case, the bot to which we are concerned is Googlebot.

Scanning our server log files allows us to check if Googlebot has already visited a certain page on our website. If Googlebot has never visited a certain page, it may not have been indexed by Google. I personally tend to use KNIME for this purpose, with the integrated Web Log Reader node, but do not hesitate to use your preferred solution.

Screaming Frog Log File Analyzer provides an easier solution for log file analysis.
Screaming Frog Log File Analyzer provides an easier solution for log file analysis.
Like Google Analytics, scanning log files is not foolproof. Googlebot may visit a page but do not include it in its index.

Combining your data
To restrict our list of pages that can not be indexed by Google as much as possible, I recommend combining the captured data using the Google Analytics technique with the log file analysis methods above.

Conclusion
Given that Google does not provide a tool or data on whether a web page has been indexed or not, and we are not allowed to use an automated solution like the one I wrote previously, we must Try to narrow down our list of URLs that may not be indexed.

We can do this by reviewing our Google Analytics data for pages on our website that do not receive organic Google traffic and view server log files. From there, we can manually check our short URL list punctually.

It's not an ideal solution, but it does the job. I hope that in the future, Google will provide a better way to evaluate which pages have been indexed and which ones have not.

Total Views: 388Word Count: 590See All articles From Author

Add Comment

Education Articles

1. Why Chennai Graduates Are Moving Toward Business Analytics
Author: sudeshna

2. Why Google Maps Is The Easiest Way To Discover The Best Cbse Schools In Howrah
Author: Siya

3. Sap Abap Rap Course Online With Projects At Visualpath
Author: gollakalyan

4. Dynamics 365 Training | Microsoft Dynamics 365 Crm Training
Author: naveen

5. Best Salesforce Data Cloud Training Course | Online Training
Author: Vamsi Ulavapati

6. How To Find The Best Ib Maths Tutor In Uae (dubai, Abu Dhabi & Beyond)
Author: Kapil

7. Complete Guide To Cpp Dumps And Exam Pass Support For Certification Success
Author: certpasscenter

8. Importance Of Excel In Data Analytics
Author: Kriti M

9. Is A Job-ready Azure Internship Better Than A Traditional It Course? Here's What The Numbers Say
Author: Evision Technoserve

10. Mba In Meerut That Actually Prepares You For The Data And Ai Era
Author: content editor for samphire it solution

11. Mba Roi Calculator: How To Measure Returns Before Admission
Author: UniversityGuru

12. Cgeit Dumps And Exam Pass Support: A Smart Way To Prepare For Certification Success
Author: certfastpass

13. Osai+ Certification: Your Complete Roadmap To Becoming A Modern Cybersecurity Specialist
Author: NYTCC

14. Osth Certification: Your Complete Roadmap To Building A Powerful Cybersecurity Career
Author: Passyourcert

15. Pass Your Ecir Certification Today
Author: Passyourcert

Login To Account
Login Email:
Password:
Forgot Password?
New User?
Sign Up Newsletter
Email Address: