beforeitsnews.com
robots.txt
Robots Exclusion Standard data for beforeitsnews.com
Resource Scan
Scan Details
Site Domain | beforeitsnews.com |
Base Domain | beforeitsnews.com |
Scan Status | Ok |
Last Scan | 2024-11-13T21:13:34+00:00 |
Next Scan | 2024-11-20T21:13:34+00:00 |
Last Scan
Scanned | 2024-11-13T21:13:34+00:00 |
URL | https://beforeitsnews.com/robots.txt |
Domain IPs | 104.21.94.231, 172.67.141.76, 2606:4700:3034::ac43:8d4c, 2606:4700:3036::6815:5ee7 |
Response IP | 172.67.141.76 |
Found | Yes |
Hash | e3457a08517a290fe1f9079fe1c0fbb9eeb11fe7eade829e23d4b33ba4490336 |
SimHash | e017d836e693 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /dashboard/* |
Disallow | /special/* |
Disallow | /mediadrop/* |
Disallow | /captcha/* |
Disallow | /ckeditor* |
Disallow | /ckfinder/* |
Disallow | /core/* |
Disallow | /cron-job/* |
Disallow | /dAjax/* |
Disallow | /social-connect/* |
Disallow | /scripts/* |
Disallow | /scan/* |
Disallow | /static/adcode/* |
Disallow | /static/tracking/* |
Disallow | /story/* |
Disallow | /logs/* |
Disallow | /login/* |
Disallow | /v3/login/* |
Other Records
Field | Value |
---|---|
sitemap | https://beforeitsnews.com/sitemap.xml |