pall.com
robots.txt

Robots Exclusion Standard data for pall.com

Resource Scan

Scan Details

Site Domain pall.com
Base Domain pall.com
Scan Status Ok
Last Scan2024-10-28T21:35:16+00:00
Next Scan 2024-11-27T21:35:16+00:00

Last Scan

Scanned2024-10-28T21:35:16+00:00
URL https://pall.com/robots.txt
Redirect https://www.pall.com/robots.txt
Redirect Domain www.pall.com
Redirect Base pall.com
Domain IPs 151.101.131.10, 151.101.195.10, 151.101.3.10, 151.101.67.10
Redirect IPs 151.101.131.10, 151.101.195.10, 151.101.3.10, 151.101.67.10
Response IP 199.232.47.10
Found Yes
Hash f957b7a7d7ee67fbd763aaf9754ab89ef70ae25e69f8dc36e5815d5db70fcd85
SimHash 611f89d56195

Groups

pallbot

Rule Path
Disallow

googlebot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

baiduspider

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

bingbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

slurp

Rule Path
Disallow

gsa-crawler

Rule Path
Disallow

yandexbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

brightedge crawler

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

mj12bot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.pall.com/sitemap.xml
sitemap https://www.pall.com/ar-sitemap.xml
sitemap https://www.pall.com/br-sitemap.xml
sitemap https://www.pall.com/de-sitemap.xml
sitemap https://www.pall.com/kr-sitemap.xml