getsitecontrol.com
robots.txt

Robots Exclusion Standard data for getsitecontrol.com

Resource Scan

Scan Details

Site Domain getsitecontrol.com
Base Domain getsitecontrol.com
Scan Status Ok
Last Scan2025-08-26T01:55:27+00:00
Next Scan 2025-09-25T01:55:27+00:00

Last Scan

Scanned2025-08-26T01:55:27+00:00
URL https://getsitecontrol.com/robots.txt
Domain IPs 104.26.6.129, 104.26.7.129, 172.67.73.142, 2606:4700:20::681a:681, 2606:4700:20::681a:781, 2606:4700:20::ac43:498e
Response IP 104.26.7.129
Found Yes
Hash 421bf6dc0cef0e6a8273a9f9415e2d9a2bc40bc8ca5e46e4a0f3d72f03d46372
SimHash 2814195089f1

Groups

*

Rule Path
Allow /
Disallow /unsubscribed/
Disallow /error/
Disallow /close
Disallow /t/
Disallow /te/
Disallow /i/
Disallow /w/

twitterbot

Rule Path
Allow /w/
Allow /i/

facebookexternalhit

Rule Path
Allow /w/
Allow /i/

Other Records

Field Value
sitemap https://getsitecontrol.com/sitemap.xml

Comments

  • Certain social media sites are whitelisted to allow crawlers to access page markup when links to /images are shared.