gazeterize.com
robots.txt

Robots Exclusion Standard data for gazeterize.com

Resource Scan

Scan Details

Site Domain gazeterize.com
Base Domain gazeterize.com
Scan Status Ok
Last Scan2024-10-01T01:26:36+00:00
Next Scan 2024-10-08T01:26:36+00:00

Last Scan

Scanned2024-10-01T01:26:36+00:00
URL https://gazeterize.com/robots.txt
Redirect https://www.gazeterize.com/robots.txt
Redirect Domain www.gazeterize.com
Redirect Base gazeterize.com
Domain IPs 104.21.80.60, 172.67.174.184, 2606:4700:3035::6815:503c, 2606:4700:3037::ac43:aeb8
Redirect IPs 104.21.80.60, 172.67.174.184, 2606:4700:3035::6815:503c, 2606:4700:3037::ac43:aeb8
Response IP 172.67.174.184
Found Yes
Hash d1f149db4125b31d8934dbc60417a110845e04f51ea4fbf209949c37ef9c96aa
SimHash 21001623ec13

Groups

*

Rule Path
Disallow /arama
Disallow /public
Disallow /public/*
Disallow /service*
Disallow /share*
Disallow /tr/*
Disallow /*?ref=
Disallow /*?q=
Disallow /*?preview=
Disallow /*?utm_source=
Disallow /*?page=
Disallow /*?cursor=
Allow /

adsbot-google

Rule Path
Disallow /advert/*
Allow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.gazeterize.com/sitemap.xml