Open In App

How to read a numerical data or file in Python with numpy?

Prerequisites: Numpy 

NumPy is a general-purpose array-processing package. It provides a high-performance multidimensional array object and tools for working with these arrays. This article depicts how numeric data can be read from a file using Numpy.



Numerical data can be present in different formats of file :

There are multiple ways of storing data in files and the above ones are some of the most used formats for storing numerical data. To achieve our required functionality numpy’s loadtxt() function will be used.



Syntax: numpy.loadtxt(fname, dtype=’float’, comments=’#’, delimiter=None, converters=None, skiprows=0, usecols=None, unpack=False, ndmin=0)

Parameters:
fname : File, filename, or generator to read. If the filename extension is .gz or .bz2, the file is first decompressed. Note that generators should return byte strings for Python 3k.
dtype : Data-type of the resulting array; default: float. If this is a structured data-type, the resulting array will be 1-dimensional, and each row will be interpreted as an element of the array.
delimiter : The string used to separate values. By default, this is any whitespace.
converters : A dictionary mapping column number to a function that will convert that column to a float. E.g., if column 0 is a date string: converters = {0: datestr2num}. Default: None.
skiprows : Skip the first skiprows lines; default: 0.

Returns: ndarray

Approach

Given below are some implementation for various file formats:

Link to download data files used : 

Example 1: Reading numerical data from text file




# Importing libraries that will be used
import numpy as np
 
# Setting name of the file that the data is to be extracted from in python
filename = 'gfg_example1.txt'
 
# Loading file data into numpy array and storing it in variable called data_collected
data_collected = np.loadtxt(filename)
 
# Printing data stored
print(data_collected)
 
 
# Type of data
print(
    f'Stored in : {type(data_collected)} and data type is : {data_collected.dtype}')

Output : 

Output of Example 1

Example 2: Reading numerical data from CSV file.




# Importing libraries that will be used
import numpy as np
 
# Setting name of the file that the data is to be extracted from in python
# This is a comma separated values file
filename = 'gfg_example2.csv'
 
# Loading file data into numpy array and storing it in variable.
# We use a delimiter that basically tells the code that at every ',' we encounter,
# we need to treat it as a new data point.
# The data type of the variables is set to be int using dtype parameter.
data_collected = np.loadtxt(filename, delimiter=',', dtype=int)
 
# Printing data stored
print(data_collected)
 
 
# Type of data
print(
    f'Stored in : {type(data_collected)} and data type is : {data_collected.dtype}')

Output : 

Output of Example 2

Example 3: Reading from tsv file




# Importing libraries that will be used
import numpy as np
 
# Setting name of the file that the data is to be extracted from in python
# This
filename = 'gfg_example3.tsv'
 
# Loading file data into numpy array and storing it in variable called data_collected
# We use a delimiter that basically tells the code that at every ',' we encounter,
# we need to treat it as a new data point.
data_collected = np.loadtxt(filename, delimiter='\t')
 
# Printing data stored
print(data_collected)
 
 
# Type of data
print(
    f'Stored in : {type(data_collected)} and data type is : {data_collected.dtype}')

Output : 

Output of Example 3

Example 4: Select only particular rows and skip some rows




# Importing libraries that will be used
import numpy as np
 
# Setting name of the file that the data is to be extracted from in python
filename = 'gfg_example4.csv'
 
# Loading file data into numpy array and storing it in variable called data_collected
data_collected = np.loadtxt(
    filename, skiprows=1, usecols=[0, 1], delimiter=',')
 
# Printing data stored
print(data_collected)
 
 
# Type of data
print(
    f'Stored in : {type(data_collected)} and data type is : {data_collected.dtype}')

Output : 

Output of Example 4


Article Tags :