slashdigit.com
robots.txt

Robots Exclusion Standard data for slashdigit.com

Resource Scan

Scan Details

Site Domain slashdigit.com
Base Domain slashdigit.com
Scan Status Ok
Last Scan2024-09-21T11:23:42+00:00
Next Scan 2024-09-28T11:23:42+00:00

Last Scan

Scanned2024-09-21T11:23:42+00:00
URL https://slashdigit.com/robots.txt
Domain IPs 139.162.7.47
Response IP 139.162.7.47
Found Yes
Hash da08fb0f2ab73e3b36d74dce54c751ccf5a0d001a599007811334c9269d2dd48
SimHash e8104bacf2b1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php

slurp

Rule Path
Disallow /

ninjabot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

googlebot-news

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.slashdigit.com/sitemap.xml

Warnings

  • 2 invalid lines.