infoclm.es
robots.txt
Robots Exclusion Standard data for infoclm.es
Resource Scan
Scan Details
Site Domain | infoclm.es |
Base Domain | infoclm.es |
Scan Status | Ok |
Last Scan | 2024-10-03T11:09:42+00:00 |
Next Scan | 2024-10-10T11:09:42+00:00 |
Last Scan
Scanned | 2024-10-03T11:09:42+00:00 |
URL | https://infoclm.es/robots.txt |
Redirect | https://www.infoclm.es/robots.txt |
Redirect Domain | www.infoclm.es |
Redirect Base | infoclm.es |
Domain IPs | 51.255.17.77 |
Redirect IPs | 51.255.17.77 |
Response IP | 51.255.17.77 |
Found | Yes |
Hash | 8451ed2aef66ce6212b3ed5b1fe66d40a6b915c6db5b180170eba6b94a39b944 |
SimHash | 4808b8508812 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | */cdn-cgi/* |
Disallow | */cdn-fpw/* |
Allow | /feed/$ |
Disallow | */feed/ |
Disallow | /2021/* |
Disallow | /2020/* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
Other Records
Field | Value |
---|---|
sitemap | https://www.infoclm.es/news-sitemap.xml |