infobloom.com
robots.txt

Robots Exclusion Standard data for infobloom.com

Resource Scan

Scan Details

Site Domain infobloom.com
Base Domain infobloom.com
Scan Status Ok
Last Scan2024-06-04T22:20:09+00:00
Next Scan 2024-06-11T22:20:09+00:00

Last Scan

Scanned2024-06-04T22:20:09+00:00
URL https://infobloom.com/robots.txt
Redirect https://www.infobloom.com/robots.txt
Redirect Domain www.infobloom.com
Redirect Base infobloom.com
Domain IPs 52.8.17.221, 52.8.5.233
Redirect IPs 108.157.254.108, 108.157.254.50, 108.157.254.81, 108.157.254.93, 2600:9000:2753:200:9:2198:cb00:93a1, 2600:9000:2753:4800:9:2198:cb00:93a1, 2600:9000:2753:600:9:2198:cb00:93a1, 2600:9000:2753:800:9:2198:cb00:93a1, 2600:9000:2753:b200:9:2198:cb00:93a1, 2600:9000:2753:be00:9:2198:cb00:93a1, 2600:9000:2753:c000:9:2198:cb00:93a1, 2600:9000:2753:c800:9:2198:cb00:93a1
Response IP 108.157.254.93
Found Yes
Hash 6c40f4952c835cc5ad480a14fe6c32c7970239621ffe15ba9e3da9bb4247d5ae
SimHash 0901d13f2313

Groups

*

Rule Path
Disallow /s/
Disallow /templates/
Disallow /d/
Disallow /related/
Disallow /relevant/
Disallow /videos/
Disallow /captcha.php
Disallow /*?expand_article
Disallow /*.js?cb=
Disallow /entertainment*

mediapartners-google

Rule Path
Allow /s/
Allow /related/
Allow /relevant/

Other Records

Field Value
sitemap https://www.infobloom.com/sitemap-infobloom.com-index.xml