twinsintowblog.com
robots.txt

Robots Exclusion Standard data for twinsintowblog.com

Resource Scan

Scan Details

Site Domain twinsintowblog.com
Base Domain twinsintowblog.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-22T02:42:25+00:00
Next Scan 2024-11-20T02:42:25+00:00

Last Successful Scan

Scanned2024-01-03T01:11:28+00:00
URL https://twinsintowblog.com/robots.txt
Domain IPs 192.0.78.175, 192.0.78.251
Response IP 192.0.78.251
Found Yes
Hash 541da03c29678dc8cc093a4cab70ca15f237b2602cfca46db88d8e6d5b04609a
SimHash 4c440ac0d593

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap /sitemap.xml
sitemap /news-sitemap.xml
sitemap /sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK