nl.newz.com
robots.txt

Robots Exclusion Standard data for nl.newz.com

Resource Scan

Scan Details

Site Domain nl.newz.com
Base Domain newz.com
Scan Status Ok
Last Scan2026-03-08T23:07:06+00:00
Next Scan 2026-04-07T23:07:06+00:00

Last Scan

Scanned2026-03-08T23:07:06+00:00
URL https://nl.newz.com/robots.txt
Domain IPs 104.21.68.67, 172.67.191.40, 2606:4700:3032::6815:4443, 2606:4700:3032::ac43:bf28
Response IP 172.67.191.40
Found Yes
Hash 6a3cd13972f6a4c7d18307834b81ea297d5bc14ad4becec15e7d8634ca8655f2
SimHash 401ccf60661b

Groups

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://nl.newz.com/sitemap_index.xml
sitemap https://nl.newz.com/sloty/sitemap.xml
sitemap https://nl.newz.com/news-sitemap.xml