新人求解: Python 写入 txt 时数据丢失的问题

代码如下： def download(href_urls): for url in href_urls: mod_titles = [] ses = requests.session() html = ses.get(url, headers = header(), verify = False) soup = BeautifulSoup(html.content, 'html.parser') title_list = soup.find(class_ = 'g-ctnBar').find_all('a') title1 = title_list[2].get_text() title2 = title_list[3].get_text() title3 = title_list[4].get_text() title4 = title_list[5].get_text() list_ = soup.find_all('div', class_ = 'detail-mod J_floor')[:-3] for txt in list_: txts = txt.get_text() download_run(title1, title2, title3, title4, txts)

def download_run(title1, title2, title3, title4, txts): path = 'C:/Users/Desktop/run/%s/%s/%s' %(title1, title2, title3) if not os.path.exists(path): os.makedirs(path) with open('C:/Users/Desktop/run/%s/%s/%s/%s.txt' %(title1, title2, title3, title4), 'w')as f: f.write(txts)

txts

title1

title2

title3

18 replies • 2020-01-15 12:03:16 +08:00

chaneyccy

Jan 14, 2020

排版有点乱，更新一下

def download(href_urls):

for url in href_urls:

mod_titles = []

ses = requests.session()

html = ses.get(url, headers = header(), verify = False)

soup = BeautifulSoup(html.content, 'html.parser')

title_list = soup.find(class_ = 'g-ctnBar').find_all('a')

title1 = title_list[2].get_text()

title2 = title_list[3].get_text()

title3 = title_list[4].get_text()

title4 = title_list[5].get_text()

list_ = soup.find_all('div', class_ = 'detail-mod J_floor')[:-3]

for txt in list_:

txts = txt.get_text()

download_run(title1, title2, title3, title4, txts)

def download_run(title1, title2, title3, title4, txts):

path = 'C:/Users/Desktop/run/%s/%s/%s' %(title1, title2, title3)

if not os.path.exists(path):

os.makedirs(path)

with open('C:/Users/Desktop/run/%s/%s/%s/%s.txt' %(title1, title2, title3, title4), 'w')as f:

f.write(txts)

JCZ2MkKb5S8ZX9pq

Jan 14, 2020 via iPhone

```
你的代码
```

这样可以保留模式。回复时无效。详情可查 markdown 格式。

JCZ2MkKb5S8ZX9pq

Jan 14, 2020 via iPhone

格式

chaneyccy

Jan 14, 2020

@JCZ2MkKb5S8ZX9pq 好的，平时没有用 markdown 写内容的习惯~ 我去研究下

cxyfreedom

Jan 14, 2020

你遍历循环又每次写入，你循环完成后，txts 本来就只有一部分的数据，写入到文件中当然就只有最后一部分。

Vegetable

Jan 14, 2020

https://www.runoob.com/python/python-func-open.html
看表格

Vegetable

Jan 14, 2020

@cxyfreedom #5 代码我读了，是因为每次都 open(file,'w')写入的原因。

Wuuuu

Jan 14, 2020

感觉应该是 txts = txts+"\n"+txt.get_txt()?

Wuuuu

Jan 14, 2020

@Vegetable py 靠缩进……这样的不知道到底是
for txt in list_:

txts = txt.get_text()

download_run(title1, title2, title3, title4, txts)

还是
for txt in list_:

txts = txt.get_text()

download_run(title1, title2, title3, title4, txts)

但大概率是第二种写法吧。

Wuuuu

Jan 14, 2020

for txt in list_:

\t txts = txt.get_text()

\t download_run(title1, title2, title3, title4, txts)

for txt in list_:

\t txts = txt.get_text()

download_run(title1, title2, title3, title4, txts)