plcsaigon.com
robots.txt

Robots Exclusion Standard data for plcsaigon.com

Resource Scan

Scan Details

Site Domain plcsaigon.com
Base Domain plcsaigon.com
Scan Status Ok
Last Scan2025-05-23T08:13:25+00:00
Next Scan 2025-06-22T08:13:25+00:00

Last Scan

Scanned2025-05-23T08:13:25+00:00
URL https://plcsaigon.com/robots.txt
Domain IPs 210.245.125.82
Response IP 210.245.125.82
Found Yes
Hash 6599f5b883398af50541af20fc5e4d3885c3d5b00e7e22304ae18351b0ca05b0
SimHash af15de4adcd0

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

adsbot-google

Rule Path
Disallow /checkout
Disallow /checkouts
Disallow /carts
Disallow /orders
Disallow /discount/*

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.plcsaigon.com/sitemap.xml
sitemap https://www.plcsaigon.com/sitemap.xml
sitemap https://www.plcsaigon.com/sitemap.xml

Comments

  • we use Haravan as our ecommerce platform
  • Google adsbot ignores robots.txt unless specifically named!