Consider the following set of training data.classX11X212N33+ + + + 1301 1000101-111. Plot these eight training points in a two-dimensional space (with x1 and x2). Are the classes {+, −} linearly separable? Why? Construct the weight vector of the maximum margin hyperplane by inspection and identify the support vectors.2. If you remove one of the support vectors, does the size of the optimal margin decrease, stay the same, or increase? Justify your answer. Your answer to the previous question still hold for any dataset in a 2-dimentioanl space? Provide an example if it is not true, or give a short proof if it is true. What if we have datasets in a space with more than two dimensions, do you have the same answer? Please justify.