{"id":8,"date":"2020-03-29T06:43:25","date_gmt":"2020-03-28T22:43:25","guid":{"rendered":"https:\/\/saveweb.othing.xyz\/iamwu555\/archives\/8"},"modified":"2021-06-06T00:05:38","modified_gmt":"2021-06-05T16:05:38","slug":"%e5%9b%bd%e5%ae%b6%e5%9c%b0%e7%90%86%e6%af%8f%e6%97%a5%e7%b2%be%e9%80%89%e7%88%ac%e5%8f%96","status":"publish","type":"post","link":"https:\/\/blog.save-web.org\/iamwu555\/archives\/8","title":{"rendered":"\u56fd\u5bb6\u5730\u7406\u6bcf\u65e5\u7cbe\u9009\u722c\u53d6"},"content":{"rendered":"<p>\u5728\u5237Jellow\u7684\u65f6\u5019\u770b\u89c1\u4e00\u4e2a\u4eba\u8bf4\u7b2c\u4e00\u6b21\u4ed8\u8d39\u5e94\u7528\uff0c\u56e0\u4e3a\u56fd\u5bb6\u5730\u7406\u6bcf\u65e5\u7cbe\u9009\u91cc\u7684\u56fe\u7247\u592a\u597d\u770b\uff0c\u6211\u5c31\u597d\u5947\u6709\u591a\u597d\u770b\u5417\uff1f<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-351\" title=\"20203291-3415208\" src=\"https:\/\/saveweb.othing.xyz\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208.jpg\" alt=\"20203291-3415208\" width=\"1242\" height=\"1242\" srcset=\"https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208.jpg 1242w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208-300x300.jpg 300w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208-1024x1024.jpg 1024w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208-150x150.jpg 150w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203291-3415208-768x768.jpg 768w\" sizes=\"auto, (max-width: 1242px) 100vw, 1242px\" \/><\/p>\n<p>\u6070\u9022\u524d\u51e0\u5929\u722c\u866b\u722c\u4e86\u5fc5\u5e94\u6bcf\u65e5\u4e00\u56fe\u6210\u5c31\u611f\u6ee1\u6ee1\uff0c\u4eca\u65e5\u5c31\u51c6\u5907\u90a3\u8fd9\u4e2a\u56fd\u5bb6\u5730\u7406\u518d\u6b21\u8bd5\u8bd5\u624b\u3002<\/p>\n<h2>\u8fc7\u7a0b<\/h2>\n<p>\u9996\u5148\u5c31\u5728\u7f51\u4e0a\u627e\u56fd\u5bb6\u5730\u7406\u6bcf\u65e5\u5bb6\u7cbe\u9009\u7684web\u7aef\uff0c http:\/\/dili.bdatu.com\/index.php\/Share\/adindex\/id\/208 \uff0c\u53ea\u53d1\u73b0\u4e86\u8fd9\u4e2a\uff0c\u901a\u8fc7\u6539\u53d8id\u53d1\u73b0\u6709\u4e9bid\u597d\u50cf\u8fd8\u6ca1\u6709\u5185\u5bb9\uff0c\u4f46\u662f\u4e0d\u4f1a\u62a5\u9519\uff0c\u521a\u5f00\u59cb\u6211\u4ee5\u4e3a\u662f\u7279\u5b9a\u7684\u6570\u5b57\u6bb5id\u624d\u6709\u5185\u5bb9\u7684\uff0c\u7136\u540e\u5c31\u6ca1\u6709\u592a\u5728\u610f\u8fd9\u4e2a\u3002\u8fd9\u6837\u8003\u8651\u7684\u8bdd\u6211\u5c31\u9047\u5230\u4e86\u4e00\u4e2a\u95ee\u9898\uff0c\u6211\u624b\u52a8\u6d4b\u8bd5ID\u53d1\u73b0202\u6709\u5185\u5bb9\uff0c203,204\u2026\u2026\u90fd\u662f\u6709\u7684\uff0c\u4e8e\u662f\u6211\u5c31\u6311\u4e86\u4e00\u6bb5\u6d4b\u8bd5\u4e00\u4e0b\u90fd\u662f\u6709\u7684\uff0c\u4e8e\u662f\u6211\u5927\u80c6\u7684\u731c\u6d4b\u4ed6\u5728202-1000\u8fd9\u4e2a\u5b57\u6bb5\u90fd\u662f\u6709\u5185\u5bb9\u7684\uff0c\u76f4\u63a5\u5199\u4ee3\u7801\u722c\u53d6\u3002\u4e8e\u662f\u95ee\u9898\u5c31\u6765\u4e86\uff0c202-206\u90fd\u662f\u53ef\u4ee5\u7684\uff0c207\u5c31\u62a5\u9519\u4e86\uff0c\u7136\u540e202-206\u7684\u5185\u5bb9\u4e5f\u6ca1\u6709\u4fdd\u5b58\u4e0b\u6765\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-350\" title=\"20203292-3887965\" src=\"https:\/\/saveweb.othing.xyz\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965.jpg\" alt=\"20203292-3887965\" width=\"1920\" height=\"1030\" srcset=\"https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965.jpg 1920w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965-300x161.jpg 300w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965-1024x549.jpg 1024w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965-768x412.jpg 768w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203292-3887965-1536x824.jpg 1536w\" sizes=\"auto, (max-width: 1920px) 100vw, 1920px\" \/><\/p>\n<pre><code>Traceback (most recent call last):\r\n  File \"e:\/Python\/webpapapa\/NAGE.py\", line 11, in \r\n    src = items[1].get('src')\r\nIndexError: list index out of range<\/code><\/pre>\n<p>\u8fd9\u4e2a\u65f6\u5019\u6211\u624d\u660e\u767d\u6211\u5728b\u7ad9\u770b\u89c6\u9891\u7684\u65f6\u5019\u8001\u5e08\u4e00\u76f4\u5f3a\u8c03\u4e86\u4e00\u4e2atry\u3001except\u5728\u722c\u866b\u4e2d\u7684\u91cd\u8981\u6027\uff0c\u4e8e\u662f\u6211\u5c31\u53bb\u52a0\u4e0a\u8fd9\u4e2a\u4ee3\u7801\uff0c\u4e00\u5f00\u59cb\u6211\u4ee5\u4e3a\u662frequests.get\u51fa\u9519\u4e86\uff0c\u4e8e\u662f\u6211\u52a0\u4e0a\u4e86<\/p>\n<pre><code>try:\r\n    return r.content\r\nexcept ConnectionError:\r\n    print('\u8fde\u63a5\u5931\u8d25')<\/code><\/pre>\n<p>\u4f46\u662f\u8fd8\u662f\u4f1a\u51fa\u9519\uff0c\u8fdb\u4e00\u6b65\u7814\u7a76\u53d1\u73b0\uff0c\u8fd9\u4e2a\u7f51\u9875get\u7684\u6ca1\u6709\u95ee\u9898\uff0c\u95ee\u9898\u5728\u4e8e\u83b7\u53d6img\u6807\u7b7e\u7684src\u7684\u503c<\/p>\n<pre><code>items = soup.find_all(\"img\")\r\nsrc = items[i].get('src')<\/code><\/pre>\n<p>\u6211\u4eec\u5728\u7f8e\u4e3d\u6c64\u91cc\u71ac\u6210\u7684\u6c64\u4e2d\u6ca1\u6709img\u6807\u7b7e\u6240\u4ee5items\u83b7\u53d6\u7684\u5b57\u5178\u5c31\u4e3a\u7a7a\uff0c\u8fd9\u4e2a\u65f6\u5019get(&#8216;src&#8217;)\u5c31\u4f1a\u62a5\u9519\uff0c\u53d1\u73b0\u4e86\u95ee\u9898\u6240\u5728\uff0c\u5c31\u5f88\u597d\u89e3\u51b3\u4e86\uff0c\u540e\u9762\u5c31\u987a\u5229\u722c\u53d6\u4e86\u4e00\u4e9b\u56fe\u7247\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-352\" title=\"20203293-2463972\" src=\"https:\/\/saveweb.othing.xyz\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/06\/20203293-2463972.jpg\" alt=\"20203293-2463972\" width=\"480\" height=\"200\" \/><\/p>\n<p>\u8fd9\u6b21\u4f3c\u4e4e\u633a\u6210\u529f\u7684\uff0c\u4f46\u662f\u7531\u4e8e\u6211\u4e00\u53e3\u6c14\u8bbe\u7f6e\u4e86202-1000\u8fd9\u4e48\u591a\u4e2a\u8fde\u63a5\uff0c\u4e8e\u662f\u722c\u5230\u7b2c983\u9875\u7684\u65f6\u5019\u51fa\u73b0\u4e86\u4e00\u4e2a\u95ee\u9898\uff0c\u4e00\u4e2a\u6ca1\u6709\u9884\u60f3\u5230\u7684\u95ee\u9898\u3002<\/p>\n<pre><code>Traceback (most recent call last):\r\n  File \"C:Users\u543eAppDataLocalProgramsPythonPython37libsite-packagesrequestsmodels.py\", line 750, in generate\r\n    for chunk in self.raw.stream(chunk_size, decode_content=True):\r\n  File \"C:Users\u543eAppDataLocalProgramsPythonPython37libsite-packagesurllib3response.py\", line 494, in stream\r\n    data = self.read(amt=amt, decode_content=decode_content)\r\n  File \"C:Users\u543eAppDataLocalProgramsPythonPython37libsite-packagesurllib3response.py\", line 459, in read\r\n    raise IncompleteRead(self._fp_bytes_read, self.length_remaining)\r\n  File \"C:Users\u543eAppDataLocalProgramsPythonPython37libcontextlib.py\", line 130, in __exit__\r\n    self.gen.throw(type, value, traceback)\r\n    raise ProtocolError('Connection broken: %r' % e, e)\r\nurllib3.exceptions.ProtocolError: (\"Connection broken: ConnectionResetError(10054, '\u8fdc\u7a0b\u4e3b\u673a\u5f3a\u8feb\u5173\u95ed\u4e86\u4e00\u4e2a\u73b0\u6709\u7684\u8fde\u63a5\u3002', None, 10054, None)\", ConnectionResetError(10054, '\u8fdc\u7a0b\u4e3b\u673a\u5f3a\u8feb\u5173\u95ed\u4e86\u4e00\u4e2a\u73b0\u6709\u7684\u8fde\u63a5\u3002', None, 10054, None))<\/code><\/pre>\n<p>\u767e\u5ea6\u4e86\u4e00\u4e0b\u53d1\u73b0\u662f\u56e0\u4e3a\u4f7f\u7528urlopen\u65b9\u6cd5\u592a\u8fc7\u9891\u7e41\uff0c\u5f15\u8d77\u8fdc\u7a0b\u4e3b\u673a\u7684\u6000\u7591\uff0c\u88ab\u7f51\u7ad9\u8ba4\u5b9a\u4e3a\u662f\u653b\u51fb\u884c\u4e3a\u3002\u5bfc\u81f4urlopen()\u540e\uff0crequest.read()\u4e00\u76f4\u5361\u6b7b\u5728\u90a3\u91cc\u3002\u6700\u540e\u629b\u51fa10054\u5f02\u5e38\u3002 \u89e3\u51b3\u65b9\u6cd5\u662f\u8981time.sleep(t)\uff0c\u6bcf\u6b21\u8bbf\u95ee\u540e\u4f11\u606f\u4e00\u4f1a\u518d\u722c\uff0c\u8ba9\u8fd9\u4e2a\u884c\u4e3a\u66f4\u50cf\u4eba\u4e00\u70b9\u3002<\/p>\n<p>\u6709\u4e00\u70b9\u70b9\u660e\u767d\u4e86\uff0c\u4f46\u662f\u6211\u5e76\u6ca1\u6709\u89e3\u51b3\u4ed6\uff0c\u4efb\u7531\u5b83\u53bb\u4e86\uff0c\u4ee5\u540e\u5199\u722c\u866b\u7684\u65f6\u5019\u6ce8\u610f\u4e00\u4e0b\u5c31\u597d\u3002<\/p>\n<p>\u8fd9\u6b21\u722c\u866b\u603b\u5171\u722c\u53d6\u4e86740\u5f20\uff0c\u4e5f\u4e0d\u77e5\u9053\u8d28\u91cf\u5982\u4f55\uff0c\u53ea\u662f\u53bb\u7ec3\u4e60\u722c\u866b\u3002<\/p>\n<p>\u53ef\u5728\u6211\u7684\u7f51\u76d8\u4e2d\u83b7\u53d6\u6240\u6709\u56fe\u7247\uff1ahttp:\/\/pan.wubuster.com\/?\/national_geographic.zip<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-335\" title=\"20203294-2392893\" src=\"https:\/\/saveweb.othing.xyz\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893.jpg\" alt=\"20203294-2392893\" width=\"1730\" height=\"1031\" srcset=\"https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893.jpg 1730w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893-300x179.jpg 300w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893-1024x610.jpg 1024w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893-768x458.jpg 768w, https:\/\/blog.save-web.org\/iamwu555\/wp-content\/uploads\/sites\/5\/2021\/04\/20203294-2392893-1536x915.jpg 1536w\" sizes=\"auto, (max-width: 1730px) 100vw, 1730px\" \/><\/p>\n<h2>\u4ee3\u7801<\/h2>\n<pre><code>import requests\r\nfrom bs4 import BeautifulSoup\r\nimport os\r\n\r\ndef get_html(url):\r\n    headers = {'User-agent': 'Mozilla\/5.0 (Windows NT 6.3; WOW64) AppleWebKit\/537.36 (KHTML, like Gecko) Chrome\/55.0.2883.87 UBrowser\/6.2.3964.2 Safari\/537.36'}\r\n    r = requests.get(url, headers=headers)\r\n    try:\r\n        return r.content\r\n    except ConnectionError:\r\n        print('\u8fde\u63a5\u5931\u8d25')\r\n    \r\n\r\ndef download(text):\r\n    headers = {'User-agent': 'Mozilla\/5.0 (Windows NT 6.3; WOW64) AppleWebKit\/537.36 (KHTML, like Gecko) Chrome\/55.0.2883.87 UBrowser\/6.2.3964.2 Safari\/537.36'}\r\n    soup = BeautifulSoup(text, 'html.parser')\r\n    items = soup.find_all(\"img\")\r\n    for i in range(1,11):\r\n        if i:\r\n            try:\r\n                src = items[i].get('src')\r\n                img = requests.get(src,headers=headers)\r\n                root = \"E:\/\/national_geographic\/\/\"\r\n                if os.path.exists(root) == False:\r\n                    os.makedirs(root)\r\n                path = root + src.split('\/')[-1]\r\n                with open(path, 'wb') as f: #wb\u662f\u4ee5\u4e8c\u8fdb\u5236\u5199\u5165\r\n                    f.write(img.content) #r.content\u5c31\u662f\u4e8c\u8fdb\u5236\u67e5\u770b\r\n                    f.close()\r\n            except:\r\n                print(\"src\u83b7\u53d6\u5931\u8d25\")        \r\ndef main():\r\n    for i in range(202,1000):\r\n        url = 'http:\/\/dili.bdatu.com\/index.php\/Share\/adindex\/id\/{}'.format(i)\r\n        text = get_html(url)\r\n        download(text)\r\n        print(\"\u7b2c{}\u9875\u4e0b\u8f7d\u5b8c\u6210\".format(i))\r\n\r\nif __name__ == \"__main__\":\r\n    main()<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>\u5728\u5237Jellow\u7684\u65f6\u5019\u770b\u89c1\u4e00\u4e2a\u4eba\u8bf4\u7b2c\u4e00\u6b21\u4ed8\u8d39\u5e94\u7528\uff0c\u56e0\u4e3a\u56fd\u5bb6\u5730\u7406\u6bcf\u65e5\u7cbe\u9009\u91cc\u7684\u56fe\u7247\u592a\u597d\u770b\uff0c\u6211\u5c31\u597d\u5947\u6709\u591a\u597d\u770b\u5417\uff1f \u6070 &hellip; <a href=\"https:\/\/blog.save-web.org\/iamwu555\/archives\/8\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;\u56fd\u5bb6\u5730\u7406\u6bcf\u65e5\u7cbe\u9009\u722c\u53d6&#8221;<\/span><\/a><\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-8","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/posts\/8","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/comments?post=8"}],"version-history":[{"count":6,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/posts\/8\/revisions"}],"predecessor-version":[{"id":421,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/posts\/8\/revisions\/421"}],"wp:attachment":[{"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/media?parent=8"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/categories?post=8"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.save-web.org\/iamwu555\/wp-json\/wp\/v2\/tags?post=8"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}