newsnit.com
robots.txt

Robots Exclusion Standard data for newsnit.com

Resource Scan

Scan Details

Site Domain newsnit.com
Base Domain newsnit.com
Scan Status Ok
Last Scan2025-10-26T22:42:54+00:00
Next Scan 2025-11-25T22:42:54+00:00

Last Scan

Scanned2025-10-26T22:42:54+00:00
URL https://newsnit.com/robots.txt
Domain IPs 104.21.88.116, 172.67.178.102, 2606:4700:3031::ac43:b266, 2606:4700:3033::6815:5874
Response IP 104.21.88.116
Found Yes
Hash 1498bde87d2ede4ca715e4f74e468f71bc72b7220ae1d496b15856035215ce94
SimHash 6901404547b2

Groups

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /tag/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://newsnit.com/sitemap_index.xml