Specimen,Num attachments,inc excutable,inc ZIP,inc PDF,inc DOC,Unknown Format,URL count,outside network,Email Size,Verified as Malware VS0001,1,Yes,No,No,No,No,1,Yes,76172,...

1 answer below »
Student ID: 10460276


Specimen,Num attachments,inc excutable,inc ZIP,inc PDF,inc DOC,Unknown Format,URL count,outside network,Email Size,Verified as Malware VS0001,1,Yes,No,No,No,No,1,Yes,76172, VS0002,1,No,Yes,No,No,No,0,Yes,248404, VS0003,1,No,No,No,No,Yes,2,Yes,2841,Yes VS0004,2,Yes,No,No,Yes,No,0,Yes,132988,Yes VS0005,1,No,No,No,Yes,No,1,Yes,117140, VS0006,2,No,No,Yes,Yes,No,0,Yes,127923, VS0007,0,No,No,No,No,No,0,Yes,1660, VS0008,1,No,No,No,No,Yes,1,Yes,1542,Yes VS0009,0,No,No,No,No,No,0,,4405, VS0010,0,No,No,No,No,No,0,Yes,2397, VS0011,3,Yes,No,Yes,No,Yes,0,Yes,90557, VS0012,1,No,No,No,No,Yes,3,Yes,2257,Yes VS0013,0,No,No,No,No,No,0,,3003, VS0014,2,No,No,No,No,Yes,0,Yes,337252, VS0015,4,No,No,Yes,No,Yes,0,Yes,281918, VS0016,1,No,No,No,Yes,No,0,Yes,159965, VS0017,12,No,No,No,Yes,Yes,0,Yes,1832141, VS0018,1,No,No,No,No,Yes,1,Yes,547919, VS0019,0,No,No,No,No,No,0,,3440, VS0020,1,No,No,No,No,Yes,0,,214397, VS0021,2,No,No,Yes,Yes,No,0,Yes,228191, VS0022,1,No,Yes,No,No,No,3,Yes,31347,Yes VS0023,7,No,No,No,No,Yes,0,Yes,86098, VS0024,0,No,No,No,No,No,0,Yes,3021, VS0025,3,No,No,No,No,Yes,0,Yes,243311, VS0026,2,No,No,No,No,Yes,0,Yes,143480, VS0027,0,No,No,No,No,No,0,Yes,3226, VS0028,1,No,No,No,No,Yes,1,Yes,63547, VS0029,0,No,No,No,No,No,0,Yes,3381, VS0030,0,No,No,No,No,No,0,Yes,2965, VS0031,1,No,No,No,No,Yes,3,Yes,1546,Yes VS0032,0,No,No,No,No,No,0,Yes,3447, VS0033,1,No,No,No,No,Yes,3,Yes,2505,Yes VS0034,1,No,No,No,No,Yes,3,Yes,1047,Yes VS0035,3,No,No,No,No,Yes,0,Yes,311384, VS0036,4,No,No,No,Yes,Yes,0,,6534551, VS0037,0,No,No,No,No,No,0,Yes,2746, VS0038,1,No,No,Yes,No,No,0,Yes,120543, VS0039,0,No,No,No,No,No,0,Yes,3051, VS0040,1,No,No,No,No,Yes,3,Yes,1753,Yes VS0041,0,No,No,No,No,No,0,Yes,4964, VS0042,3,No,No,Yes,No,Yes,0,Yes,194948, VS0043,0,No,No,No,No,No,0,Yes,2753, VS0044,3,No,No,Yes,No,Yes,0,,129767, VS0045,0,No,No,No,No,No,0,Yes,4452, VS0046,0,No,No,No,No,No,0,Yes,1081, VS0047,2,No,No,Yes,No,Yes,0,,65351, VS0048,1,No,No,No,No,Yes,0,Yes,245290, VS0049,1,No,No,Yes,No,No,0,Yes,348898, VS0050,1,Yes,No,No,No,No,0,Yes,36045,Yes VS0051,0,No,No,No,No,No,0,Yes,1741, VS0052,1,No,No,No,No,Yes,0,,51171, VS0053,0,No,No,No,No,No,0,,3580, VS0054,3,No,No,Yes,No,Yes,0,Yes,102171, VS0055,1,No,No,Yes,No,No,0,,101480, VS0056,1,No,No,No,No,Yes,0,,61616, VS0057,2,No,No,No,Yes,Yes,1,,16129, VS0058,0,No,No,No,No,No,0,,4488, VS0059,1,No,No,No,No,Yes,1,Yes,2314,Yes VS0060,2,No,No,No,No,Yes,0,,237898, VS0061,0,No,No,No,No,No,0,Yes,3606, VS0062,1,No,No,No,Yes,No,0,Yes,19908,Yes VS0063,2,Yes,No,No,No,Yes,0,Yes,4853, VS0064,0,No,No,No,No,No,0,Yes,3808, VS0065,1,No,No,Yes,No,No,0,,199577, VS0066,0,No,No,No,No,No,0,Yes,3520, VS0067,1,No,No,Yes,No,No,0,,293976, VS0068,2,Yes,No,No,Yes,No,3,Yes,1948,Yes VS0069,2,No,No,No,No,Yes,0,Yes,16289, VS0070,3,Yes,No,Yes,Yes,No,0,Yes,64428,Yes VS0071,1,No,No,Yes,No,No,0,Yes,175709, VS0072,0,No,No,No,No,No,0,,3067, VS0073,2,No,No,Yes,No,Yes,0,Yes,362159, VS0074,1,No,No,No,No,Yes,1,Yes,9425, VS0075,1,No,No,No,No,Yes,0,Yes,168632, VS0076,0,No,No,No,No,No,0,Yes,2609, VS0077,2,No,No,Yes,No,Yes,0,Yes,652046, VS0078,1,No,No,Yes,No,No,0,Yes,68650, VS0079,1,No,No,Yes,No,No,1,Yes,212416, VS0080,2,No,No,No,No,Yes,0,,160514, VS0081,0,No,No,No,No,No,0,Yes,3224, VS0082,0,No,No,No,No,No,0,,2941, VS0083,0,No,No,No,No,No,0,Yes,2843, VS0084,0,No,No,No,No,No,0,Yes,4095, VS0085,0,No,No,No,No,No,0,Yes,3047, VS0086,1,No,No,No,No,Yes,0,Yes,138768, VS0087,1,No,No,No,No,Yes,3,Yes,2114,Yes VS0088,0,No,No,No,No,No,2,,2976, VS0089,4,Yes,No,No,No,Yes,0,Yes,347282, VS0090,1,No,Yes,No,No,No,0,Yes,19390,Yes VS0091,2,No,No,Yes,Yes,No,3,Yes,14011,Yes VS0092,4,No,No,No,Yes,Yes,0,Yes,4869737, VS0093,1,No,Yes,No,No,No,0,Yes,26285,Yes VS0094,0,No,No,No,No,No,0,,3821, VS0095,2,Yes,No,No,Yes,No,0,Yes,18928,Yes VS0096,1,No,No,No,No,Yes,0,Yes,332639, VS0097,1,No,No,No,No,Yes,1,,309972, VS0098,3,No,No,Yes,Yes,Yes,0,Yes,115102, VS0099,0,No,No,No,No,No,0,Yes,3393, VS0100,0,No,No,No,No,No,0,Yes,2149, VS0101,7,No,No,Yes,No,Yes,0,,158759, VS0102,2,No,No,Yes,Yes,No,0,Yes,145296, VS0103,1,No,No,No,No,Yes,1,Yes,2590,Yes VS0104,1,No,No,No,No,Yes,0,Yes,179314, VS0105,1,No,No,No,No,Yes,0,,46784, VS0106,1,No,No,No,No,Yes,0,Yes,4126274, VS0107,2,No,No,No,No,Yes,0,Yes,7124268, VS0108,0,No,No,No,No,No,0,Yes,3117, VS0109,0,No,No,No,No,No,1,,3650, VS0110,0,No,No,No,No,No,0,Yes,2142, VS0111,3,No,No,Yes,No,Yes,0,Yes,314432, VS0112,0,No,No,No,No,No,0,,1787, VS0113,0,No,No,No,No,No,0,Yes,3718, VS0114,0,No,No,No,No,No,0,Yes,2377, VS0115,2,Yes,No,Yes,No,No,0,Yes,10689,Yes VS0116,2,No,No,Yes,Yes,No,0,Yes,70026,Yes VS0117,1,No,No,No,No,Yes,0,,2976160, VS0118,3,No,No,Yes,Yes,Yes,0,Yes,207515, VS0119,0,No,No,No,No,No,0,Yes,2452, VS0120,1,No,No,No,No,Yes,1,Yes,779,Yes VS0121,0,No,No,No,No,No,0,Yes,2898, VS0122,0,No,No,No,No,No,0,Yes,4328, VS0123,3,Yes,No,Yes,Yes,No,0,Yes,102250,Yes VS0124,0,No,No,No,No,No,0,,2297, VS0125,1,No,No,No,Yes,No,0,Yes,84493, VS0126,0,No,No,No,No,No,0,Yes,2221, VS0127,0,No,No,No,No,No,0,Yes,1606, VS0128,2,Yes,No,No,No,Yes,0,Yes,323368, VS0129,3,No,No,No,No,Yes,0,Yes,130786, VS0130,1,No,No,No,No,Yes,3,Yes,2217,Yes VS0131,3,No,No,Yes,Yes,Yes,0,,217428, VS0132,0,No,No,No,No,No,0,Yes,2953, VS0133,1,No,No,No,Yes,No,0,,88030, VS0134,0,No,No,No,No,No,0,Yes,4151, VS0135,4,No,No,Yes,Yes,Yes,0,Yes,207624, VS0136,2,No,No,No,No,Yes,0,,339246, VS0137,1,No,No,Yes,No,No,0,,100431, VS0138,0,No,No,No,No,No,0,Yes,313, VS0139,1,No,No,No,No,Yes,2,Yes,2298,Yes VS0140,1,No,No,No,Yes,No,0,,175396, VS0141,1,No,No,Yes,No,No,0,Yes,48801, VS0142,3,No,No,No,No,Yes,0,Yes,341595, VS0143,0,No,No,No,No,No,0,Yes,3110, VS0144,1,No,No,Yes,No,No,4,Yes,300233, VS0145,2,No,No,Yes,Yes,No,0,Yes,44485, VS0146,2,No,No,No,No,Yes,1,Yes,305250, VS0147,0,No,No,No,No,No,0,Yes,2953, VS0148,1,No,No,No,Yes,No,3,Yes,37245,Yes VS0149,4,No,No,No,No,Yes,0,Yes,252039, VS0150,2,Yes,No,Yes,No,No,4,Yes,31866,Yes VS0151,1,No,No,Yes,No,No,0,Yes,68701, VS0152,2,No,No,Yes,No,Yes,0,Yes,245030, VS0153,1,No,Yes,No,No,No,0,Yes,122747, VS0154,3,No,No,Yes,No,Yes,1,,467391, VS0155,2,Yes,No,No,Yes,No,0,Yes,62865,Yes VS0156,0,No,No,No,No,No,0,,935, VS0157,3,No,No,No,No,Yes,0,Yes,273500, VS0158,1,No,No,No,Yes,No,0,Yes,172379, VS0159,0,No,No,No,No,No,0,Yes,3001, VS0160,0,No,No,No,No,No,0,Yes,2774, VS0161,1,No,No,No,No,Yes,0,Yes,263618, VS0162,0,No,No,No,No,No,0,Yes,3004, VS0163,0,No,No,No,No,No,0,Yes,1921, VS0164,1,No,No,No,Yes,No,0,,217514, VS0165,4,No,No,Yes,No,Yes,0,Yes,248239, VS0166,1,No,No,No,No,Yes,0,Yes,160174, VS0167,0,No,No,No,No,No,0,Yes,2321, VS0168,4,Yes,No,No,No,Yes,0,Yes,260519, VS0169,0,No,No,No,No,No,0,Yes,3253, VS0170,0,No,No,No,No,No,0,Yes,3943, VS0171,0,No,No,No,No,No,0,Yes,3707, VS0172,0,No,No,No,No,No,0,Yes,4060, VS0173,7,No,No,No,No,Yes,0,,164982, VS0174,2,No,No,Yes,No,Yes,3,,565610, VS0175,0,No,No,No,No,No,0,Yes,3922, VS0176,0,No,No,No,No,No,0,Yes,3357, VS0177,3,No,No,No,Yes,Yes,0,Yes,326782, VS0178,1,No,No,Yes,No,No,0,,197340, VS0179,0,No,No,No,No,No,0,,3164, VS0180,1,No,No,Yes,No,No,0,Yes
Answered Same DayApr 03, 2021

Answer To: Specimen,Num attachments,inc excutable,inc ZIP,inc PDF,inc DOC,Unknown Format,URL count,outside...

Subhanbasha answered on Apr 04 2021
149 Votes
Principal Component Analysis
Data preparation steps
The data is mostly in form of categorical variables which is yes/no format. The datapoints consisted of “yes” only. So, we
made of probability values and replaced yes/no in the place of the null values.
Specimen Num.attachments inc.excutable inc.ZIP inc.PDF inc.DOC
1 VS0001 1 Yes No No No
2 VS0002 1 No Yes No No
3 VS0003 1 No No No No
4 VS0004 2 Yes No No Yes
5 VS0005 1 No No No Yes
6 VS0006 2 No No Yes Yes
Unknown.Format URL.count outside.network Email.Size Verified.as.Malware
1 No 1 Yes 76172
2 No 0 Yes 248404
3 Yes 2 Yes 2841 Yes
4 No 0 Yes 132988 Yes
5 No 1 Yes 117140
6 No 0 Yes 127923
From the above output we have qualitative data, this is generally not accepted by Principal Component method. So, we make use of dummies to transform the data into numerical variables.
Then we took a random sample by making use of the sample function in R and tranformed it into a separate dataset. Then we checked for the missing elements if any were present using the sapply() function. Then we split the data into two parts by separating the sample data. Then we used the rbind() function to combine the dataframes. Then convert...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here