theceo.in
robots.txt

Robots Exclusion Standard data for theceo.in

Resource Scan

Scan Details

Site Domain theceo.in
Base Domain theceo.in
Scan Status Ok
Last Scan2024-11-16T13:36:36+00:00
Next Scan 2024-11-17T13:36:36+00:00

Last Scan

Scanned2024-11-16T13:36:36+00:00
URL http://theceo.in/robots.txt
Redirect https://www.theceo.in/robots.txt
Redirect Domain www.theceo.in
Redirect Base theceo.in
Domain IPs 23.20.179.164, 54.158.195.16
Redirect IPs 104.18.90.198, 104.18.91.198, 104.18.92.198, 104.18.93.198, 104.18.94.198, 2606:4700::6812:5ac6, 2606:4700::6812:5bc6, 2606:4700::6812:5cc6, 2606:4700::6812:5dc6, 2606:4700::6812:5ec6
Response IP 104.18.94.198
Found Yes
Hash 563cc574bf4aed0d762f5260f3e6f2114ea802c9126c99697a516f7a574f24d6
SimHash 703e9c102fb2

Groups

*

Rule Path
Disallow /template-options
Disallow /search?q=*
Disallow /author/
Disallow /topic/

semrush

Rule Path
Disallow /

ahref

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

claude

Rule Path
Disallow /

open ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.theceo.in/sitemap.xml