pall.com
robots.txt

Robots Exclusion Standard data for pall.com

Resource Scan

Scan Details

Site Domain pall.com
Base Domain pall.com
Scan Status Ok
Last Scan2024-06-30T20:14:45+00:00
Next Scan 2024-07-30T20:14:45+00:00

Last Scan

Scanned2024-06-30T20:14:45+00:00
URL https://pall.com/robots.txt
Redirect https://www.pall.com:443/robots.txt
Redirect Domain www.pall.com
Redirect Base pall.com
Domain IPs 13.227.254.35, 13.227.254.43, 13.227.254.72, 13.227.254.77
Redirect IPs 13.227.254.35, 13.227.254.43, 13.227.254.72, 13.227.254.77
Response IP 13.227.254.43
Found Yes
Hash f957b7a7d7ee67fbd763aaf9754ab89ef70ae25e69f8dc36e5815d5db70fcd85
SimHash 611f89d56195

Groups

pallbot

Rule Path
Disallow

googlebot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

baiduspider

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

bingbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

slurp

Rule Path
Disallow

gsa-crawler

Rule Path
Disallow

yandexbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

brightedge crawler

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

mj12bot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.pall.com/sitemap.xml
sitemap https://www.pall.com/ar-sitemap.xml
sitemap https://www.pall.com/br-sitemap.xml
sitemap https://www.pall.com/de-sitemap.xml
sitemap https://www.pall.com/kr-sitemap.xml