webpagespots.com
robots.txt

Robots Exclusion Standard data for webpagespots.com

Resource Scan

Scan Details

Site Domain webpagespots.com
Base Domain webpagespots.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2025-12-06T00:48:38+00:00
Next Scan 2026-01-05T00:48:38+00:00

Last Successful Scan

Scanned2025-11-11T11:24:57+00:00
URL https://webpagespots.com/robots.txt
Domain IPs 104.21.95.21, 172.67.169.45, 2606:4700:3031::6815:5f15, 2606:4700:3034::ac43:a92d
Response IP 172.67.169.45
Found Yes
Hash c1107310ae43a2354d1f947af961da7f841e5ea62b7cab12f165ef570b8d1ec1
SimHash 66005d40e87f

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Allow /ads/preferences/
Allow /dtt/k
Allow /gpt/
Allow /pagead/show_ads.js
Allow /pagead/js/adsbygoogle.js
Allow /pagead/*/show_ads_impl.js
Allow /static/glade.js
Allow /static/glade/
Allow /tag/js/
Disallow /wp-admin/
Disallow /cgi-bin/
Disallow /?s=*
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /profile/
Disallow /?p=*
Disallow /feed/
Disallow /comments/
Disallow /readme.html
Disallow /refer/

Other Records

Field Value
crawl-delay 15

googlebot-image

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://webpagespots.com/sitemap_index.xml