data:image/s3,"s3://crabby-images/72697/726973876b01628c86a420f8317caf74b7176ba0" alt="Create random dataset in python"
In addition to Faker and numpy, we’ll also need the handy pandas library. To begin, let’s make sure we have the necessary libraries installed. sales) based on a distribution or randomly select from a list. We will also use the Python numpy library since it will allow to create numeric fields (e.g. We’ll explore those most relevant for customer demos but the documentation details all the “providers” of fake data available in the library. It is useful to create realistic looking datasets and can generate all types of data.
data:image/s3,"s3://crabby-images/30cc1/30cc165d4f558bb4ec4de01bed39533a7139a5ec" alt="create random dataset in python create random dataset in python"
For this demo, we’ll upload the newly created datasets to SAP HANA Cloud as tables.įaker is a Python library that generates fake data for you. Once we create the datasets, we have a lot of flexibility with how we use them.
#CREATE RANDOM DATASET IN PYTHON HOW TO#
We can easily create such datasets in Python, and this blog will serve as a guide on how to use the Faker, numpy, and pandas libaries in Python to generate any datasets you need. Also, it would be nice to generate realistic looking PII data in case you needed to demonstrate data masking. Ideally, we would be able to create a dataset of any size easily and able to specify constraints on the data, such as matching data formats the customer may use or specifying the statistical distribution of the random data.
data:image/s3,"s3://crabby-images/40280/402806325ec4c817306489dcb5526c7c9a32042c" alt="create random dataset in python create random dataset in python"
We can create more engaging customer experiences if we had more realistic datasets that more closely resembled their own data. As Solution Advisors, we often need to create custom datasets to support customer opportunities.
data:image/s3,"s3://crabby-images/72697/726973876b01628c86a420f8317caf74b7176ba0" alt="Create random dataset in python"