newswav.com
robots.txt
Robots Exclusion Standard data for newswav.com
Resource Scan
Scan Details
Site Domain | newswav.com |
Base Domain | newswav.com |
Scan Status | Ok |
Last Scan | 2024-06-29T08:08:40+00:00 |
Next Scan | 2024-07-06T08:08:40+00:00 |
Last Scan
Scanned | 2024-06-29T08:08:40+00:00 |
URL | https://newswav.com/robots.txt |
Domain IPs | 104.22.36.155, 104.22.37.155, 172.67.22.220, 2606:4700:10::6816:249b, 2606:4700:10::6816:259b, 2606:4700:10::ac43:16dc |
Response IP | 104.22.36.155 |
Found | Yes |
Hash | ebea7c019d5043b688c6566b3ff59cc8c5353b24e4b928c5f6957f6d359f2c0e |
SimHash | 4d1053e6c391 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
googlebot
googlebot
googlebot-video
googlebot-image
googlebot-news
googlebot-mobile
applebot
bingbot
bingpreview
baiduspider
baiduspider-mobile
baiduspider-video
baiduspider-image
naverbot
yeti
yandex
yandexbot
yandexmobilebot
yandexvideo
yandexwebmaster
yandexsitelinks
seznambot
slurp
duckduckbot
grapeshot
exabot
Rule | Path |
---|---|
Allow | / |
Disallow | /error |
Other Records
Field | Value |
---|---|
sitemap | https://newswav.com/sitemap.xml |