AllElectronics carries 1000 products, P1, . . . , P1000. Consider customers Ada, Bob, and Cathy such that Ada and Bob purchase three products in common, P1,P2, and P3. For the other 997 products, Ada and Bob independently purchase seven of them randomly. Cathy purchases 10 products, randomly selected from the 1000 products. In Euclidean distance, what is the probability that dist(Ada,Bob) > dist(Ada,Cathy)

Respuesta :

Answer:

The probability that dist(Ada,Bob)>(Ada,Cathy)  is very small as there is very large number of range to choose the product ==4.7*10^-9.

Step-by-step explanation:

Given:

Ada ,bob and cathy purchase electronics carries

Ada and bob commonly take 3 products and 7 independently.

And Cathy take 10  products on its own .

To Find:

probability that    dist(Ada,bob)>dis(Ada,Cathy)?

Solution:

Using Euclidean distance  is distance formula used in coordinate geometry simply known as Distance formula,

this problem is related to Euclidean Distance and Jaccard Similarity in Data mining.

1st calculate probability for x such that ,

[tex]3\leq x\leq 10[/tex]   as there are 3 common products.

P([tex]3\leq x\leq 10[/tex])

=[tex]\frac{7C(x-3)*990C(10-x)}{997C7}[/tex].............. where x=3,4,5....10. ..........(equation 1).

Now calculate for each term,we get

When

x=3,P(x=3)=0.95

x=4,P(x=4)=[tex]6.8*10^{-3}[/tex]

x=5,P(x=5)=[tex]4.1*10^{-5}[/tex]

x=6,P(x=6)=2.1*10^-7

x=7,P(x=7)=8.5*10^-10

x=8,P(x=8)=2.6*10^-12

x=9,P(x=9)=5.2*10^-15

x=10,P(x=10)=5.3*10^-18.

Now calculating the Euclidean distance,

It is distance between two points ,

So there are total of 2 points as Ada and bob

they have 3 products in common

and 7 independent products ,7  Ada and 7 bob

Total of 17 products .

1,2,3,4,5,6..........,16,17.

Consider each product number as distance between them ,

(Suppose 5 product and 1 product distance will be 4)

Similarly,

Suppose Ada is at 3rd number at the  3 product (as they have 3 product same.)

and bob  at product 17.

Hence when 3 products are similar distance between Ada and bob will be of 14.

Euclidean distance =[tex]\sqrt{14}[/tex].

Hence the Jaccard similarity =(Ada intersection Bob)/(Ada union bob)

=3/14

When 4 products are same means both will selected 6 and 6 independent product so that  the each one will get 10 products i.e. starting condition should remain same .

Hence now  bob will be at 16th term as it will take one more same product in between them

So no of same products will be 4,

Hence Ada will be at 4th term and bob will be at 16

So Euclidean distance =[tex]\sqrt{12}[/tex].

Similar For Next terms we can conclude as follows:

When

X=5 , dist(ada,bob)=[tex]\sqrt{10}[/tex],

X=6,dist(Ada,Bob)=[tex]\sqrt{8}[/tex]

X=7,dist(Ada,Bob)=[tex]\sqrt{6}[/tex]

X=8,dist(Ada,Bob)=[tex]\sqrt{4}[/tex]

X=9,dist(Ada,Bob)=[tex]\sqrt{2}[/tex]

X=10,dist(Ada,Bob)=[tex]\sqrt{0}[/tex].

Now for( Ada and cathy)

Here X ranges different but use same concept as above

Each term analog to the distance between them

Suppose 1st and 3rd term distance will be 2

First calculate

P([tex]1\leq x\leq 10[/tex]) as Cathy selects 10 products with no common between them.

P([tex]1\leq x\leq 10[/tex])

=[tex]\frac{10Cx*990C(10-x)}{1000C10}[/tex]..................equation (2)

Calculate for each term As x=1,2,3...8,9,10.

Hence

P(X=1)=9.23*10^-3  P(X=5)=3*10^-11     P(X=9)=3.8*10^-21

P(X=2)=8.4*10^-5   P(X=6)=1.5*10^-13   P(X=10)=3.8*10^-21

P(X=3)=6.9*10^-7   P(X=7)=6.1*10^-16

P(X=4)=4.9*10^-9   P(X=8)=1.9*10^-18

So Ada will have 10 products and Cathy will have 10 products

Namely,

1,2,3,4,5.......18,19,20.

So suppose 1 product is same between them will be ,

both will have 1 product so remaining will be 19 products.

Jaccard similarity =1/19

Distance to reach 1 to 19th product will be 18

So Euclidean distance =[tex]\sqrt{18}[/tex]

For next when they will 2 products in same remaining will be 18

Jaccard similarity =2/18

And Distance to reach  2 to 18 th product will be  16

Euclidean distance =[tex]\sqrt{16}[/tex]

Similar for  other

When

x=3 dist(Ada, Cathy)=[tex]\sqrt{14}[/tex]

x=4 dist(Ada, Cathy)=[tex]\sqrt{12}[/tex]

x=5  dist(Ada, Cathy)=[tex]\sqrt{10}[/tex]

x=6  dist(Ada, Cathy)=[tex]\sqrt{8}[/tex]

x=7  dist(Ada, Cathy)=[tex]\sqrt{6}[/tex]

x=8  dist(Ada, Cathy)=[tex]\sqrt{4}[/tex]

x=9  dist(Ada, Cathy)=[tex]\sqrt{2}[/tex]

x=10  dist(Ada, Cathy)=[tex]\sqrt{0}[/tex]

This sqrt(0) means both are holding same products hence they are at same point on the graph so distance with itself will be zero.

Now the Probability of distance of dist(Ada,Bob)>dist(Ada,cathy) will be

=multiplying both the probabilities equations (Adding each term probabilities and multiplying )

=Equation(1) *Equation( 2).

=Summation Of P(3≤x≤10)*summation of P(1≤x≤10)

=4.7*10^-9.

In larger number of product event of in large space ,it is difficult( less likely)  that they will chose same product .