mahasarkar.co.in
robots.txt
Robots Exclusion Standard data for mahasarkar.co.in
Resource Scan
Scan Details
Site Domain | mahasarkar.co.in |
Base Domain | mahasarkar.co.in |
Scan Status | Ok |
Last Scan | 2024-11-12T07:21:22+00:00 |
Next Scan | 2024-11-19T07:21:22+00:00 |
Last Scan
Scanned | 2024-11-12T07:21:22+00:00 |
URL | https://mahasarkar.co.in/robots.txt |
Domain IPs | 104.21.88.184, 172.67.151.196, 2606:4700:3030::6815:58b8, 2606:4700:3036::ac43:97c4 |
Response IP | 104.21.88.184 |
Found | Yes |
Hash | db6e38a5fdacff6e8aad4c716b8bc51392797d6d341b5c0260a4b9424293393f |
SimHash | e660188169f1 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /detroitchicago/ |
Disallow | /porpoiseant/ |
Disallow | /tag/ |
Disallow | /parsonsmaize/ |
Disallow | /beardeddragon/ |
Disallow | /ezais/ |
Disallow | /tardisrocinante/ |
Disallow | /*?expand_article= |
Disallow | /*?page107854= |
Disallow | /edmontonalberta/ |
Disallow | /*?fbclid= |
Disallow | /*.pdf$ |
Other Records
Field | Value |
---|---|
sitemap | https://mahasarkar.co.in/sitemap_index.xml |