gloucesterboy.com
robots.txt

Robots Exclusion Standard data for gloucesterboy.com

Resource Scan

Scan Details

Site Domain gloucesterboy.com
Base Domain gloucesterboy.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2025-07-20T18:29:07+00:00
Next Scan 2025-10-18T18:29:07+00:00

Last Successful Scan

Scanned2023-03-10T20:42:37+00:00
URL https://gloucesterboy.com/robots.txt
Domain IPs 130.211.40.170
Response IP 130.211.40.170
Found Yes
Hash 1cce9d1828c613b835c09a2f65a83523f2cfbe90f14b5eea69c61ac9d0b29053
SimHash 69384c16c012

Groups

spinn3r

Rule Path
Disallow /

*

Rule Path
Disallow /api/
Disallow /thanks
Disallow */listing/*/similar

Other Records

Field Value
sitemap https://www.gloucesterboy.com/sitemaps.xml?sitemap=listings&offset=0
sitemap https://www.gloucesterboy.com/sitemaps.xml?sitemap=blogs&offset=0
sitemap https://www.gloucesterboy.com/sitemaps.xml?sitemap=pages&offset=0

Comments

  • \
  • -----
  • | . . |
  • -----
  • \--|-|--/
  • | |
  • |-------|