newstral.com
robots.txt
Robots Exclusion Standard data for newstral.com
Resource Scan
Scan Details
Site Domain | newstral.com |
Base Domain | newstral.com |
Scan Status | Ok |
Last Scan | 2024-11-18T10:49:52+00:00 |
Next Scan | 2024-11-25T10:49:52+00:00 |
Last Scan
Scanned | 2024-11-18T10:49:52+00:00 |
URL | https://newstral.com/robots.txt |
Domain IPs | 138.201.137.196 |
Response IP | 138.201.137.196 |
Found | Yes |
Hash | 7367b52450c03c802d1294e44791ab156683faa08ea62f0a662ce2616aea76e6 |
SimHash | 321c7d6de572 |
Groups
*
Rule | Path |
---|---|
Disallow | /nl/maps |
Disallow | /nl/regions |
Disallow | /nl/people |
Disallow | /nl/organisations |
Disallow | /en/cars |
Disallow | /es/cars |
Disallow | /nl/cars |
Disallow | /de/article/en |
Disallow | /de/article/es |
Disallow | /en/article/de |
Disallow | /en/article/es |
Disallow | /en/article/nl |
Disallow | /es/article/en |
Disallow | /es/article/de |
Disallow | /es/article/nl |
Disallow | /nl/article/en |
Disallow | /nl/article/es |
Disallow | /nl/article/de |
Disallow | /sources/1 |
Disallow | /sources/2 |
Disallow | /sources/3 |
Disallow | /sources/4 |
Disallow | /sources/5 |
Disallow | /sources/6 |
Disallow | /sources/7 |
Disallow | /sources/8 |
Disallow | /sources/9 |
Other Records
Field | Value |
---|---|
sitemap | https://newstral.com/sitemap_index.xml |
Warnings
- 2 invalid lines.
Comments