indyapages.com
robots.txt

Robots Exclusion Standard data for indyapages.com

Resource Scan

Scan Details

Site Domain indyapages.com
Base Domain indyapages.com
Scan Status Ok
Last Scan2024-09-30T17:28:56+00:00
Next Scan 2024-10-07T17:28:56+00:00

Last Scan

Scanned2024-09-30T17:28:56+00:00
URL http://indyapages.com/robots.txt
Redirect http://www.indyapages.com/robots.txt
Redirect Domain www.indyapages.com
Redirect Base indyapages.com
Domain IPs 66.147.238.51
Redirect IPs 66.147.238.51
Response IP 66.147.238.51
Found Yes
Hash 5b752eb8af4ee2140435d1fe07e8481188a3852c22cd5c261088d71fec05e3cb
SimHash eb4cc9d74597

Groups

facebookexternalhit

Rule Path
Disallow /api/
Allow /

*

Rule Path
Disallow /admin/
Disallow /include/
Disallow /about/contact
Disallow /inquiry
Disallow /checkout/*
Disallow /videos?q=*
Disallow /jobs?q=*
Disallow /login*
Disallow /getmatched
Disallow /pro/*
Disallow /join?claim=*

googlebot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

yahoo-mmcrawler

Rule Path
Allow /

yahoo-slurp

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

bing preview bot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.indyapages.com/sitemap.xml