5lejnews.com
robots.txt
Robots Exclusion Standard data for 5lejnews.com
Resource Scan
Scan Details
Site Domain | 5lejnews.com |
Base Domain | 5lejnews.com |
Scan Status | Ok |
Last Scan | 2025-05-05T05:02:58+00:00 |
Next Scan | 2025-05-12T05:02:58+00:00 |
Last Scan
Scanned | 2025-05-05T05:02:58+00:00 |
URL | https://5lejnews.com/robots.txt |
Redirect | https://news.5lejnews.com/robots.txt |
Redirect Domain | news.5lejnews.com |
Redirect Base | 5lejnews.com |
Domain IPs | 104.21.77.121, 172.67.207.170, 2606:4700:3034::6815:4d79, 2606:4700:3036::ac43:cfaa |
Redirect IPs | 104.21.77.121, 172.67.207.170, 2606:4700:3034::6815:4d79, 2606:4700:3036::ac43:cfaa |
Response IP | 104.21.77.121 |
Found | Yes |
Hash | f590df534fdcde44edad733380de57529ed08315010e86aa16ac119ea8f03298 |
SimHash | ed3998007f51 |
Groups
*
Rule | Path |
---|---|
Disallow | /panel |
Disallow | /cron |
Disallow | /ajax |
Disallow | /widgets_factory |
Disallow | /auth |
Disallow | /login |
Disallow | /register |
Disallow | /style |
Disallow | /printit |
Disallow | /emailthis |
Disallow | /outside |
Other Records
Field | Value |
---|---|
sitemap | https://news.5lejnews.com/sitemap.xml |