python 的numpy库中的mean()函数用法介绍

更新时间：2020年03月03日 16:08:04 作者：饕餮争锋

这篇文章主要介绍了python 的numpy库中的mean()函数用法介绍，具有很好对参考价值，希望对大家有所帮助。一起跟随小编过来看看吧

1. mean() 函数定义：

numpy.mean(a, axis=None, dtype=None, out=None, keepdims=<class numpy._globals._NoValue at 0x40b6a26c>)[source]
Compute the arithmetic mean along the specified axis.

Returns the average of the array elements. The average is taken over the flattened array by default, otherwise over the specified axis. float64intermediate and return values are used for integer inputs.

Parameters:

Parameters:	a : array_like Array containing numbers whose mean is desired. If a is not an array, a conversion is attempted. axis : None or int or tuple of ints, optional Axis or axes along which the means are computed. The default is to compute the mean of the flattened array. New in version 1.7.0. If this is a tuple of ints, a mean is performed over multiple axes, instead of a single axis or all the axes as before. dtype : data-type, optional Type to use in computing the mean. For integer inputs, the default is `float64`; for floating point inputs, it is the same as the input dtype. out : ndarray, optional Alternate output array in which to place the result. The default is `None`; if provided, it must have the same shape as the expected output, but the type will be cast if necessary. See `doc.ufuncs` for details. keepdims : bool, optional If this is set to True, the axes which are reduced are left in the result as dimensions with size one. With this option, the result will broadcast correctly against the input array. If the default value is passed, then keepdims will not be passed through to the mean method of sub-classes of ndarray, however any non-default value will be. If the sub-classes sum method does not implement keepdims any exceptions will be raised.
Returns:	m : ndarray, see dtype parameter above If out=None, returns a new array containing the mean values, otherwise a reference to the output array is returned.

a : array_like

Array containing numbers whose mean is desired. If a is not an array, a conversion is attempted.

axis : None or int or tuple of ints, optional

Axis or axes along which the means are computed. The default is to compute the mean of the flattened array.

New in version 1.7.0.

If this is a tuple of ints, a mean is performed over multiple axes, instead of a single axis or all the axes as before.

dtype : data-type, optional

Type to use in computing the mean. For integer inputs, the default is float64; for floating point inputs, it is the same as the input dtype.

out : ndarray, optional

Alternate output array in which to place the result. The default is None; if provided, it must have the same shape as the expected output, but the type will be cast if necessary. See doc.ufuncs for details.

keepdims : bool, optional

If this is set to True, the axes which are reduced are left in the result as dimensions with size one. With this option, the result will broadcast correctly against the input array.

If the default value is passed, then keepdims will not be passed through to the mean method of sub-classes of ndarray, however any non-default value will be. If the sub-classes sum method does not implement keepdims any exceptions will be raised.

Returns:

m : ndarray, see dtype parameter above

If out=None, returns a new array containing the mean values, otherwise a reference to the output array is returned.

2 mean()函数功能：求取均值

经常操作的参数为axis，以m * n矩阵举例：

axis 不设置值，对 m*n 个数求均值，返回一个实数

axis = 0：压缩行，对各列求均值，返回 1* n 矩阵

axis =1 ：压缩列，对各行求均值，返回 m *1 矩阵

举例：

>>> import numpy as np

>>> num1 = np.array([[1,2,3],[2,3,4],[3,4,5],[4,5,6]])
>>> now2 = np.mat(num1)
>>> now2
matrix([[1, 2, 3],
  [2, 3, 4],
  [3, 4, 5],
  [4, 5, 6]])


>>> np.mean(now2) # 对所有元素求均值
3.5


>>> np.mean(now2,0) # 压缩行，对各列求均值
matrix([[ 2.5, 3.5, 4.5]])


>>> np.mean(now2,1) # 压缩列，对各行求均值
matrix([[ 2.],
  [ 3.],
  [ 4.],
  [ 5.]])

补充拓展：numpy的np.nanmax和np.max区别（坑）

numpy的np.nanmax和np.array([1,2,3,np.nan]).max()的区别（坑）

numpy中numpy.nanmax的官方文档

原理

在计算dataframe最大值时，最先用到的一定是Series对象的max()方法()，最终结果是4。

s1 = pd.Series([1,2,3,4,np.nan])
s1_max = s1.max()

但是笔者由于数据量巨大，列数较多，于是为了加快计算速度，采用numpy进行最大值的计算，但正如以下代码，最终结果得到的是nan，而非4。发现，采用这种方式计算最大值，nan也会包含进去，并最终结果为nan。

s1 = pd.Series([1,2,3,4,np.nan])
s1_max = s1.values.max()
>>>nan

通过阅读numpy的文档发现，存在np.nanmax的函数，可以将np.nan排除进行最大值的计算，并得到想要的正确结果。

当然不止是max，min 、std、mean 均会存在列中含有np.nan时，s1.values.min /std/mean ()返回nan的情况。

速度区别

速度由快到慢依次:

s1 = pd.Series([1,2,3,4,5,np.nan])
#速度由快至慢
np.nanmax(s1.values) > np.nanmax(s1) > s1.max()

以上这篇python 的numpy库中的mean()函数用法介绍就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持脚本之家。

您可能感兴趣的文章:

python中with语句结合上下文管理器操作详解
这篇文章主要给大家介绍了关于python中with语句结合上下文管理器操作的相关资料，文中通过示例代码介绍的非常详细，对大家学习或者使用python具有一定的参考学习价值，需要的朋友们下面来一起学习学习吧
2019-12-12
举例讲解Python中的死锁、可重入锁和互斥锁
这篇文章主要介绍了举例讲解Python中的死锁、可重入锁和互斥锁,尽管线程编程方面Python的GIL问题老生常谈...需要的朋友可以参考下
2015-11-11
python使用Matplotlib画饼图
这篇文章主要介绍了python使用Matplotlib画饼图，具有一定的参考价值，感兴趣的小伙伴们可以参考一下
2018-09-09
Python设计模式之代理模式实例详解
这篇文章主要介绍了Python设计模式之代理模式,结合实例形式较为详细的分析了代理模式的概念、原理及Python定义、使用代理模式相关操作技巧,需要的朋友可以参考下
2019-01-01
Pytorch使用Visdom进行数据可视化的示例代码
pytorch Visdom可视化,是一个灵活的工具,用于创建,组织和共享实时丰富数据的可视化,这个博客简要介绍一下在使用Pytorch进行数据可视化的一些内容,感兴趣的朋友可以参考下
2023-12-12
Python文件处理、os模块、glob模块
这篇文章介绍了Python处理文件的方法，文中通过示例代码介绍的非常详细。对大家的学习或工作具有一定的参考借鉴价值，需要的朋友可以参考下
2022-05-05
python百行代码自制电脑端网速悬浮窗的实现
这篇文章主要介绍了python百行代码自制电脑端网速悬浮窗的实现，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2020-05-05
python计算圆周长、面积、球体体积并画出圆
这篇文章主要介绍了python计算圆周长、面积、球体体积并画出圆(python3+PyObject+Gtk实现界面联动),需要的朋友可以参考下
2014-04-04
Python sort 自定义函数排序问题
这篇文章主要介绍了Python sort 自定义函数排序问题,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教
2023-09-09
Python实现抓取腾讯视频所有电影的示例代码
这篇文章主要为大家介绍了如何使用python实现抓取腾讯视频所有电影,本文通过实例代码给大家介绍的非常详细，具有一定的参考借鉴价值，需要的朋友可以参考下
2022-04-04

python 的numpy库中的mean()函数用法介绍

相关文章

最新评论

大家感兴趣的内容

最近更新的内容

常用在线小工具