themediacaptain.com
robots.txt

Robots Exclusion Standard data for themediacaptain.com

Resource Scan

Scan Details

Site Domain themediacaptain.com
Base Domain themediacaptain.com
Scan Status Ok
Last Scan2025-04-17T01:00:33+00:00
Next Scan 2025-05-17T01:00:33+00:00

Last Scan

Scanned2025-04-17T01:00:33+00:00
URL https://themediacaptain.com/robots.txt
Redirect http://www.themediacaptain.com/robots.txt
Redirect Domain www.themediacaptain.com
Redirect Base themediacaptain.com
Domain IPs 104.21.62.210, 172.67.139.67, 2606:4700:3031::ac43:8b43, 2606:4700:3034::6815:3ed2
Redirect IPs 104.21.62.210, 172.67.139.67, 2606:4700:3031::ac43:8b43, 2606:4700:3034::6815:3ed2
Response IP 172.67.139.67
Found Yes
Hash 8264d3c7b0be704bf856fb3e8f3261ecbc30901dd7e198b1888604ee6e3d38ab
SimHash 694cd8d0e093

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.themediacaptain.com/sitemap_index.xml

Comments

  • START WPFORMS BLOCK
  • ---------------------------
  • ---------------------------
  • END WPFORMS BLOCK
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK

Warnings

  • 2 invalid lines.