nyheter24.se
robots.txt

Robots Exclusion Standard data for nyheter24.se

Resource Scan

Scan Details

Site Domain nyheter24.se
Base Domain nyheter24.se
Scan Status Ok
Last Scan2024-04-22T21:18:25+00:00
Next Scan 2024-04-29T21:18:25+00:00

Last Scan

Scanned2024-04-22T21:18:25+00:00
URL https://nyheter24.se/robots.txt
Domain IPs 104.26.2.215, 104.26.3.215, 172.67.68.234, 2606:4700:20::681a:2d7, 2606:4700:20::681a:3d7, 2606:4700:20::ac43:44ea
Response IP 172.67.68.234
Found Yes
Hash c4943e824f4f493cd684f7710d86750034fff5f0bcd5c03e960d3987713b13b5
SimHash 315104554f81

Groups

*

Rule Path
Disallow /ajax/
Disallow /plugin/
Disallow /404/
Disallow /410/

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap https://nyheter24.se/sitemap.xml
sitemap https://news.nyheter24.se

Comments

  • Allow the Grapeshot crawler full access