ciddigazete.com
robots.txt

Robots Exclusion Standard data for ciddigazete.com

Resource Scan

Scan Details

Site Domain ciddigazete.com
Base Domain ciddigazete.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-26T12:27:03+00:00
Next Scan 2024-10-25T12:27:03+00:00

Last Successful Scan

Scanned2024-06-28T12:20:38+00:00
URL https://ciddigazete.com/robots.txt
Redirect https://www.ciddigazete.com/robots.txt
Redirect Domain www.ciddigazete.com
Redirect Base ciddigazete.com
Domain IPs 104.21.77.28, 172.67.203.226, 2606:4700:3034::ac43:cbe2, 2606:4700:3037::6815:4d1c
Redirect IPs 104.21.77.28, 172.67.203.226, 2606:4700:3034::ac43:cbe2, 2606:4700:3037::6815:4d1c
Response IP 104.21.77.28
Found Yes
Hash 7a177f81e7f06f17fbfc64a832a79fb178e8219a27064ba00534ec0c6d61cbff
SimHash 6d381e36e612

Groups

*

Rule Path
Disallow /public
Disallow /public/*
Disallow /public/index.php
Disallow /public/index.php/*
Disallow /service*
Disallow /share*
Disallow /tr/*
Disallow /*?ref=
Disallow /*?q=
Disallow /*?preview=
Disallow /*?utm_source=
Disallow /*?ref=
Disallow /*?page=
Allow /

adsbot-google

Rule Path
Disallow /advert/*
Allow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.ciddigazete.com/sitemap.xml