kpmg.com
robots.txt

Robots Exclusion Standard data for kpmg.com

Resource Scan

Scan Details

Site Domain kpmg.com
Base Domain kpmg.com
Scan Status Ok
Last Scan2024-08-30T09:56:30+00:00
Next Scan 2024-09-29T09:56:30+00:00

Last Scan

Scanned2024-08-30T09:56:30+00:00
URL https://kpmg.com/robots.txt
Domain IPs 23.209.46.133, 23.209.46.150
Response IP 184.28.235.43
Found Yes
Hash c7359ce50aba3e6223808bd3288645b405b512cf16a7974f8419653561b5e238
SimHash 610e0197c5f9

Groups

cludo

Rule Path
Allow /

omtrbot/1.0

Rule Path
Disallow
Allow /

atomz/1.0

Rule Path
Disallow
Allow /

*

Rule Path
Disallow /*crawl.html
Disallow /*?keyword=
Disallow */search.html
Disallow /*.print.html
Disallow /*.model.json
Disallow /es/en/*
Disallow /fi/en/*
Disallow /tz/en/*
Disallow /rw/en/*
Disallow /ug/en/*
Disallow /is/en/*
Disallow /sl/en/*
Disallow /tc/en/*
Disallow /sz/en/*
Disallow /mw/en/*
Disallow /bw/en/*
Disallow /ao/en/*
Disallow /pa/en/*
Disallow /kg/en/*
Disallow /kg/ru/*
Disallow /us/ja/*
Disallow /dz/en/*
Disallow /tw/ja/*
Disallow /cg/fr/*
Disallow /content/dam/kpmg/au/pdf/creditors/*

youdaobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://kpmg.com/sitemap-index.xml

Comments

  • Version 2024.05.20
  • home.kpmg