wget从kaggle下载数据

step 1：导出cookie到cookie.txt。登录kaggle.com（我用的ie11），点击收藏夹,点击添加到收藏夹右边的三角形，选择导入导出，选择导出到文件，选择cokies，选择导出。
step 2： wget -cb https://www.kaggle.com/account/login?ReturnUrl=%2Fc%2Fnoaa-fisheries-steller-sea-lion-population-count%2Fdownload%2FKaggleNOAASeaLions.7z --post-data 'username=1030997649@qq.com&password=6393374'
但是这种方法下载的很慢，最后下载下来后，解压时还出错了。

于是，我找到了这种方法去解压文件：

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/30930

答案1：If it helps anyone: the default ubuntu Archive Manager failed to unpack the file for me. Was only showing the list of folders and files in them. I could unpack the archive by installing p7zip-full (sudo apt-get install p7zip-full) and doing 7z x Kaggle-NOAA-SeaLions -pPassword.

我用这种方法，还是没有解决问题。有CRC校验错误。

于是，我决定重新下载数据，找到了一种用aria2c的快速下载方法：

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/32702

sudo apt install aria2
aria2c -c -x 16 -s 16 --load-cookies cookies.txt -p https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/download/KaggleNOAASeaLions.7z

参考;http://blog.csdn.net/laozhaokun/article/details/49587463

posted on 2017-10-24 23:00 MissSimple 阅读(3036) 评论(0) 编辑收藏举报

刷新页面返回顶部

wget从kaggle下载数据

导航

公告