python中用pandas库怎样创建和读写csv文件
Admin 2022-09-13 群英技术资讯 370 次浏览
import numpy as np import pandas as pd # -----create an initial numpy array----- # data = np.zeros((8,4)) # print(data.dtype) # print(type(data)) # print(data.shape) # -----from array to dataframe----- # df = pd.DataFrame(data) # print(type(df)) # print(df.shape) # print(df) # -----edit columns and index----- # df.columns = ['A', 'B', 'C', 'D'] df.index = range(data.shape[0]) df.info() # -----save dataframe as csv----- # csv_save_path='./data_.csv' df.to_csv(csv_save_path, sep=',', index=False, header=True) # -----check----- # df = pd.read_csv(csv_save_path) print('-' * 25) print(df)
输出如下:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 8 entries, 0 to 7
Data columns (total 4 columns):
A 8 non-null float64
B 8 non-null float64
C 8 non-null float64
D 8 non-null float64
dtypes: float64(4)
memory usage: 336.0 bytes
-------------------------
A B C D
0 0.0 0.0 0.0 0.0
1 0.0 0.0 0.0 0.0
2 0.0 0.0 0.0 0.0
3 0.0 0.0 0.0 0.0
4 0.0 0.0 0.0 0.0
5 0.0 0.0 0.0 0.0
6 0.0 0.0 0.0 0.0
7 0.0 0.0 0.0 0.0
import pandas as pd import numpy as np csv_path = './data_.csv' # -----saved as dataframe----- # data = pd.read_csv(csv_path) # ---if index is given in csv file, you can use next line of code to replace the previous one--- # data = pd.read_csv(csv_path, index_col=0) print(type(data)) print(data) print(data.shape) # -----saved as array----- # data_ = np.array(data) # data_ = data.values print(type(data_)) print(data_) print(data_.shape)
输出如下:
<class 'pandas.core.frame.DataFrame'>
A B C D
0 0.0 0.0 0.0 0.0
1 0.0 0.0 0.0 0.0
2 0.0 0.0 0.0 0.0
3 0.0 0.0 0.0 0.0
4 0.0 0.0 0.0 0.0
5 0.0 0.0 0.0 0.0
6 0.0 0.0 0.0 0.0
7 0.0 0.0 0.0 0.0
(8, 4)
<class 'numpy.ndarray'>
[[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]
[0. 0. 0. 0.]]
(8, 4)
import pandas as pd import numpy as np csv_path = './data_.csv' df = pd.read_csv(csv_path) # -----edit columns and index----- # df.columns = ['X1', 'X2', 'X3', 'Y'] df.index = range(df.shape[0]) # df.index = [i+1 for i in range(df.shape[0])] # -----columns operations----- # Y = df['Y'] df['X4'] = [4 for i in range(df.shape[0])] # add df['X5'] = [5 for i in range(df.shape[0])] # print(df) df.drop(columns='Y', inplace=True) # delete # print(df) df['X1'] = [i+1 for i in range(df.shape[0])] # correct --(1) # df.iloc[:df.shape[0], 0] = [i+1 for i in range(df.shape[0])] # correct --(2) # print(df) df['Y'] = Y_temp # print(df) # -----rows operations----- # df.loc[df.shape[0]] = [i+2 for i in range(6)] # add # print(df) df.drop(index=4, inplace=True) # delete # print(df) df.loc[0] = [i+1 for i in range(df.shape[1])] # correct # print(df) # -----edit index again after rows operations!!!----- # df.index = range(df.shape[0]) # -----save dataframe as csv----- # csv_save_path='./data_copy.csv' df.to_csv(csv_save_path, sep=',', index=False, header=True) print(df)
输出如下:
X1 X2 X3 X4 X5 Y
0 1.0 2.0 3.0 4 5 6.0
1 2.0 0.0 0.0 4 5 0.0
2 3.0 0.0 0.0 4 5 0.0
3 4.0 0.0 0.0 4 5 0.0
4 6.0 0.0 0.0 4 5 0.0
5 7.0 0.0 0.0 4 5 0.0
6 8.0 0.0 0.0 4 5 0.0
7 2.0 3.0 4.0 5 6 7.0
免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:mmqy2019@163.com进行举报,并提供相关证据,查实之后,将立刻删除涉嫌侵权内容。
猜你喜欢
元组是Python中一种重要的内置数据类型。与列表一样,我们经常使用元组将多个对象保存为相应的数据容器。本文为大家总结了元组的三个不常用特性,感兴趣的小伙伴可以了解一下
这篇文章主要介绍了Python 统计数据集标签的类别及数目操作,具有很好的参考价值,希望对大家有所帮助。如有错误或未考虑完全的地方,望不吝赐教
这篇文章主要介绍了python爬虫抓取时常见的小问题总结,整理了部分新手在爬虫过程中遇到的问题,希望可以给大家提供一点问题解决的思路和参考,需要的小伙伴可以参考下面文章内容
对于Python语言来说,比较传统的数据可视化模块是Matplotlib,但它存在不够美观、静态性、不易分享等缺点,限制了Python在数据可视化方面的发展。为了解决这个问题,新型的动态可视化开源模块Plotly应运而生。本文将为大家详细介绍Plotly的用法,需要的可以参考一下
本文主要介绍Python列表复制的内容,而Python中列表的复制有直接赋值、浅复制和深复制这几种。下面我们来分别了解一下几种情况的使用和区别是什么。
成为群英会员,开启智能安全云计算之旅
立即注册Copyright © QY Network Company Ltd. All Rights Reserved. 2003-2020 群英 版权所有
增值电信经营许可证 : B1.B2-20140078 粤ICP备09006778号 域名注册商资质 粤 D3.1-20240008