themediacaptain.com
robots.txt
Robots Exclusion Standard data for themediacaptain.com
Resource Scan
Scan Details
Site Domain | themediacaptain.com |
Base Domain | themediacaptain.com |
Scan Status | Ok |
Last Scan | 2025-04-17T01:00:33+00:00 |
Next Scan | 2025-05-17T01:00:33+00:00 |
Last Scan
Scanned | 2025-04-17T01:00:33+00:00 |
URL | https://themediacaptain.com/robots.txt |
Redirect | http://www.themediacaptain.com/robots.txt |
Redirect Domain | www.themediacaptain.com |
Redirect Base | themediacaptain.com |
Domain IPs | 104.21.62.210, 172.67.139.67, 2606:4700:3031::ac43:8b43, 2606:4700:3034::6815:3ed2 |
Redirect IPs | 104.21.62.210, 172.67.139.67, 2606:4700:3031::ac43:8b43, 2606:4700:3034::6815:3ed2 |
Response IP | 172.67.139.67 |
Found | Yes |
Hash | 8264d3c7b0be704bf856fb3e8f3261ecbc30901dd7e198b1888604ee6e3d38ab |
SimHash | 694cd8d0e093 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.themediacaptain.com/sitemap_index.xml |
Warnings
- 2 invalid lines.
Comments