thewhoblog.com
robots.txt

Robots Exclusion Standard data for thewhoblog.com

Resource Scan

Scan Details

Site Domain thewhoblog.com
Base Domain thewhoblog.com
Scan Status Ok
Last Scan2024-09-23T02:43:02+00:00
Next Scan 2024-10-23T02:43:02+00:00

Last Scan

Scanned2024-09-23T02:43:02+00:00
URL https://thewhoblog.com/robots.txt
Domain IPs 104.219.248.120
Response IP 104.219.248.120
Found Yes
Hash 994dcd9ca9e95921ca20082a0d9990f195c3d50964006b612a3dcb4fa88e1867
SimHash 4804062079b7

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/uploads/wpo-plugins-tables-list.json
Allow /wp-admin/admin-ajax.php

rogerbot/1.2

Rule Path
Disallow

dotbot

Rule Path
Disallow

dotbot/1.1

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow

mj12bot

Rule Path
Disallow

semrushbot

Rule Path
Disallow

ninjabot

Rule Path
Disallow

facebot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

linkedinbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.thewhoblog.com/sitemap_index.xml
sitemap https://www.thewhoblog.com/post-sitemap.xml
sitemap https://www.thewhoblog.com/page-sitemap.xml
sitemap https://www.thewhoblog.com/category-sitemap.xml
sitemap https://www.thewhoblog.com/author-sitemap.xml

Comments

  • Adding Multiple Sitemaps