Clean dataset

The cleaned dataset will have the following structure:

Index	Name	Type	Range
0	Grade	int	[1-5]
1	Sex	enum	[0-1]
2	GPA	float	[1-5]
3	Math	int	[1-5]
4	Slovak	int	[1-5]
5	English	int	[1-5]
6	SES	enum	[0-2]
7	Occupation	enum	[0-5]
8	Living	enum	[0-4]
9	Commute	enum	[0-4]
10	Sleep	enum	[0-2]
11	Absence	int	[0-∞]

It will be saved in a .npy file (numpy format)

0 - female
1 - male

0 - lower class
1 - middle class
2 - upper class

0 - work hours / week >= 10
1 - work hours / week < 10
2 - sport
3 - music
4 - other
5 - none

0 - with family
1 - with family member
2 - alone / roomates
3 - dorms
4 - other

0 - dorms
1 - <= 15m
2 - <= 30m
3 - <= 1h
4 - > 1h

0 - short sleepers
1 - medium sleepers
2 - long sleepers