# file: robots.txt,v 1.0 2005/10/23 created by Sam # www.mychem.cn # 按照robots.txt的标准写法,规定一些不允许爬虫爬的页面或目录。 # robots.txt 的写法参照 # Format is: # User-agent: # Disallow: | # ----------------------------------------------------------------------------- User-agent: * Disallow: /Search/ User-agent: Sosospider Disallow: / User-agent: Sogou web spider Disallow: / User-agent: Slurp Crawl-delay: 60