granditalia.nl
robots.txt

Robots Exclusion Standard data for granditalia.nl

Resource Scan

Scan Details

Site Domain granditalia.nl
Base Domain granditalia.nl
Scan Status Ok
Last Scan2024-10-23T19:22:25+00:00
Next Scan 2024-11-22T19:22:25+00:00

Last Scan

Scanned2024-10-23T19:22:25+00:00
URL https://granditalia.nl/robots.txt
Redirect https://www.granditalia.nl/robots.txt
Redirect Domain www.granditalia.nl
Redirect Base granditalia.nl
Domain IPs 2a00:1a48:7903:100:78c2:8e68:0:3, 52.233.128.61
Redirect IPs 152.199.39.108, 2606:2800:247:1cb7:261b:1f9c:2074:3c
Response IP 152.199.39.108
Found Yes
Hash 9e7ee5d2c3558f6f4ae54fdb5bdafeca8a810ad65b4642c561e47508305344a9
SimHash 550cbc479eda

Groups

sogou*

Rule Path
Disallow /
Disallow /core/
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /ads.txt
Disallow /index.php/
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /node/
Disallow /search/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips
Disallow /index.php/node/add/
Disallow /index.php/node/
Disallow /index.php/search/
Disallow /index.php/user/password/
Disallow /index.php/user/register/
Disallow /index.php/user/login/
Disallow /index.php/user/logout/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.granditalia.nl/sitemap.xml

Comments

  • Robots1
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)