intaakmedia.com
robots.txt

Robots Exclusion Standard data for intaakmedia.com

Resource Scan

Scan Details

Site Domain intaakmedia.com
Base Domain intaakmedia.com
Scan Status Ok
Last Scan2026-04-01T20:12:51+00:00
Next Scan 2026-04-08T20:12:51+00:00

Last Scan

Scanned2026-04-01T20:12:51+00:00
URL https://intaakmedia.com/robots.txt
Domain IPs 2.57.91.235, 2a02:4780:84:10c4:f01e:8197:a59f:65f0, 2a02:4780:84:6ae9:43e4:b900:2350:47be, 88.222.222.248
Response IP 77.37.75.78
Found Yes
Hash 0e0a39f2cbef8140e6da6f1fc9d4aab61ad9e7fa8073e9cc42a761d2e91fac82
SimHash f201c9234cb4

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /mobiles/
Disallow /*?share
Disallow /*?wc-ajax
Disallow /wp-json/
Disallow /*?add-to-cart=
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/

pinterestbot

Rule Path
Allow /feed/
Allow */feed/
Allow /category/*/feed/

flipboard

Rule Path
Allow /feed/

flipboardproxy

Rule Path
Allow /feed/

Other Records

Field Value
sitemap https://intaakmedia.com/sitemap_index.xml