wget从kaggle下载数据

step 1:导出cookie到cookie.txt。登录kaggle.com(我用的ie11),点击收藏夹,点击添加到收藏夹右边的三角形,选择导入导出,选择导出到文件,选择cokies,选择导出。
step 2: wget -cb https://www.kaggle.com/account/login?ReturnUrl=%2Fc%2Fnoaa-fisheries-steller-sea-lion-population-count%2Fdownload%2FKaggleNOAASeaLions.7z --post-data 'username=1030997649@qq.com&password=6393374'
但是这种方法下载的很慢,最后下载下来后,解压时还出错了。

于是,我找到了这种方法去解压文件:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/30930

答案1:If it helps anyone: the default ubuntu Archive Manager failed to unpack the file for me. Was only showing the list of folders and files in them. I could unpack the archive by installing p7zip-full (sudo apt-get install p7zip-full) and doing 7z x Kaggle-NOAA-SeaLions -pPassword.

我用这种方法,还是没有解决问题。有CRC校验错误。

于是,我决定重新下载数据,找到了一种用aria2c的快速下载方法:

https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/32702

sudo apt install aria2
aria2c -c -x 16 -s 16 --load-cookies cookies.txt -p https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/download/KaggleNOAASeaLions.7z

参考;http://blog.csdn.net/laozhaokun/article/details/49587463

posted on 2017-10-24 23:00  MissSimple  阅读(3036)  评论(0编辑  收藏  举报

导航