webshark.in
robots.txt
Robots Exclusion Standard data for webshark.in
Resource Scan
Scan Details
Site Domain | webshark.in |
Base Domain | webshark.in |
Scan Status | Ok |
Last Scan | 2025-09-15T13:45:41+00:00 |
Next Scan | 2025-10-15T13:45:41+00:00 |
Last Scan
Scanned | 2025-09-15T13:45:41+00:00 |
URL | https://webshark.in/robots.txt |
Redirect | https://www.webshark.in/robots.txt |
Redirect Domain | www.webshark.in |
Redirect Base | webshark.in |
Domain IPs | 143.244.130.225 |
Redirect IPs | 104.21.73.59, 172.67.158.71, 2606:4700:3032::6815:493b, 2606:4700:3032::ac43:9e47 |
Response IP | 104.21.73.59 |
Found | Yes |
Hash | a680bd6c36dbd3a47df1432cadca22d6bfe75ee3d2a24aa0f58caadfe821f774 |
SimHash | 500e51546d55 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-json/ |
Disallow | /wp-includes/ |
Disallow | /includes/ |
Disallow | /elementor-heading/ |
Disallow | /contacthandler/ |
Disallow | /comments/ |
Disallow | /cgi-bin/ |
Disallow | /category/ |
Disallow | /author/ |
Disallow | /assests/ |
Disallow | /about-us-1/ |
Disallow | /about-us-2/ |
Disallow | /2018/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.webshark.in/sitemap-may2025.xml |