Problem 1. Consider a dataset with three columns of binary attributes A1, A2 and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the...


Problem 1. Consider a dataset with three columns of binary attributes A1, A2 and a binary label<br>attribute Y. There are eight types of data point in total, and their corresponding proportions in<br>the dataset are captured in the column P.<br>type ! A1 A2 Y| P<br>8%<br>1<br>2<br>1<br>29%<br>1<br>2%<br>18%<br>3<br>1<br>1<br>4<br>1<br>1<br>16%<br>2%<br>0.<br>6.<br>1<br>7<br>1<br>1%<br>8<br>1<br>1<br>1<br>24%<br>(a) What is the GINI index of the dataset?<br>(b) What is the GINI index of the split on A1 and<br>that on A2 respectively?<br>

Extracted text: Problem 1. Consider a dataset with three columns of binary attributes A1, A2 and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. type ! A1 A2 Y| P 8% 1 2 1 29% 1 2% 18% 3 1 1 4 1 1 16% 2% 0. 6. 1 7 1 1% 8 1 1 1 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A1 and that on A2 respectively?

Jun 04, 2022
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here