integrateideas.com
robots.txt

Robots Exclusion Standard data for integrateideas.com

Resource Scan

Scan Details

Site Domain integrateideas.com
Base Domain integrateideas.com
Scan Status Ok
Last Scan2025-12-06T03:54:01+00:00
Next Scan 2026-01-05T03:54:01+00:00

Last Scan

Scanned2025-12-06T03:54:01+00:00
URL https://integrateideas.com/robots.txt
Domain IPs 199.16.172.143
Response IP 199.16.172.143
Found Yes
Hash 4925334f4e8d9d3d5bbb9a006f729984a2a4547da089d3f9fc0b957d18d542e3
SimHash c959ca61d292

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow /category/*/*
Disallow */trackback/
Disallow */feed/
Disallow */comments/
Disallow /test/
Disallow /plesk-stat/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://integrateideas.com/sitemap_index.xml