webshark.in
robots.txt

Robots Exclusion Standard data for webshark.in

Resource Scan

Scan Details

Site Domain webshark.in
Base Domain webshark.in
Scan Status Ok
Last Scan2025-09-15T13:45:41+00:00
Next Scan 2025-10-15T13:45:41+00:00

Last Scan

Scanned2025-09-15T13:45:41+00:00
URL https://webshark.in/robots.txt
Redirect https://www.webshark.in/robots.txt
Redirect Domain www.webshark.in
Redirect Base webshark.in
Domain IPs 143.244.130.225
Redirect IPs 104.21.73.59, 172.67.158.71, 2606:4700:3032::6815:493b, 2606:4700:3032::ac43:9e47
Response IP 104.21.73.59
Found Yes
Hash a680bd6c36dbd3a47df1432cadca22d6bfe75ee3d2a24aa0f58caadfe821f774
SimHash 500e51546d55

Groups

*

Rule Path
Disallow /wp-json/
Disallow /wp-includes/
Disallow /includes/
Disallow /elementor-heading/
Disallow /contacthandler/
Disallow /comments/
Disallow /cgi-bin/
Disallow /category/
Disallow /author/
Disallow /assests/
Disallow /about-us-1/
Disallow /about-us-2/
Disallow /2018/

Other Records

Field Value
sitemap https://www.webshark.in/sitemap-may2025.xml