Drexel dragonThe Math ForumDonate to the Math Forum



Search All of the Math Forum:

Views expressed in these public forums are not endorsed by Drexel University or The Math Forum.


Math Forum » Discussions » sci.math.* » sci.stat.math.independent

Topic: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs

Replies: 148   Last Post: May 8, 2012 3:40 AM

Advanced Search

Back to Topic List Back to Topic List Jump to Tree View Jump to Tree View   Messages: [ Previous | Next ]
Ray Koopman

Posts: 3,362
Registered: 12/7/04
Re: Comparative results for {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}
Posted: May 4, 2012 1:05 AM
  Click to see the message monospaced in plain text Plain Text   Click to reply to this topic Reply

This is a combined reply to two posts.
_______________________

190 djh May 3, 6:35 am

> You wrote:
>

>> "To fix that, divide each count by the sum of the four counts,
>> to get proportions."

>
> Yes I should have thought of this myself - thanks very much for
> correcting me here.
>
> You also wrote:
>

>> "However, all this seems too ad hoc, and I'm uncomfortable with it."
>
> In response, let me ask you to put aside for the moment your basic
> objection to the whole "residual categorization" approach and consider
> the "5-predictor" model whose results were given for study and control
> groups for all six folds in the PDF I sent you offline last night.
>
> If I replace the absolute population counts for categories 00,01,10,11
> in this 5-predictor model with the proportions you just suggested,
> then this 5 predictor model boils down to:
>
> {lnL,x1,x2,w,lnLw} , where w is your "proportional" weighting factor
> rather than my "absolute count" weighting factor.
>
> And I don't see where I would be wrong to say that the factors w and
> lnLw are actually LESS ad hoc than the factors mv=2x1+x2 and lnLmv
> which I used as a pure "guess" in the last iteration that wound up
> working on all six folds EXCEPT the c2 fold.
>
> Furthermore, the fact the factors w and lnLw will allow the 5-
> predictor model to work on the c2 fold AS WELL AS the other five folds
> indicates (to me at least) that IF the residual approach is adopted
> in the first place, then a weighting predictor must be added into any
> model to account for the fact that there are systematic proportional
> differences among the 00,01,10,11 population counts at each length
> interval.


You're saying that the probability of a match in a cell is related
to the relative number of inputs in that cell compared to the three
other cells. I have to ask "Why?" Does that make sense on its own,
or is it a surrogate for something that hasn't yet been identified?
The differences in the input counts are caused by correlation among
the (absolute adjusted) residuals. Why are those residuals correlated?
What controls that correlation? You should add the *cause* of the
correlation, not the *result* of the correlation, to the model.

>
> So, assuming you're willing to let me proceed with the 5-predictor
> model {lnL,x1,x2,w,lnLw}, I have one further question.
>
> Before starting the Stage II "official tests" of this model, should I
> try the 7-predictor model {lnL,x1,x2,w,lnLx1,lnLx2,lnlw} and see how
> it compares to the 5-predictor model {lnL,x1,x2,w,lnLw}? I'm asking
> this because of the partial success we achieved a while back with the
> very simple 3-predictor model {lnL,x1,lnLx1} that you suggested ...
> i.e. mu reasoning here is that if you felt it best to add in the
> factor lnLx1 to this model, then it seems to follow that lnLx1 and
> lnLx2 should be added to the five predictor model {lnL,x1,x2,w,lnLw}
> so as to produce the 7-predictor model
> {lnL,x1,x2,w,lnLx1,lnLx2,lnlw}. What do you think?


I think the product terms need to be clearly significant.

>
> As always, thanks for your patience. Having "wallowed" in the data
> of the "category counts" for a few days now (to use the term that
> you used an email or two ago), I do belive there's good reason to
> think that the proportional weighting predictor w is NOT ad hoc,
> but reflects deep properties of the underlying system (systematic
> proportional differences between the 00,01,10,11 category counts)
> that can and should be tested on their own to get a better idea of
> what's going on.

________________________

191 djh May 3, 11:27 am

> I sent you offline a PDD with comparative results for
> {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}, where "w"
> is the proportional factor for the category 00, 01, 10, or 11.
>
> "Naked eye" evaluation of the confidence intervals seems to
> indicate that the best chance of success will come from applying
> your specified tests to the results of the 7-predictor model
> {lnL,w,lnLw,x1,x2,lnLx1,lnLx2} for the study and control groups.
>
> If you disagree, please let me know. In the meantime, I will
> start applying the tests to this model so that I can at least
> become fluent in the clerical "moves" required to execute them.


I've looked at the pdf, and I don't so much disagree as wonder what
you're taking as evidence of success, or what you really mean by
success.

A side comment: Odds ratios are meaningful only for dichotomous
predictors. Yes, you can look to see whether the CI includes 1,
but it would be easier (and would avoid what appear to be out-of-
printable-range values) to have CIs for the coefficients themselves,
and (perhaps more importantly, in your case) CIs for the difference
between corresponding coefficients in the two groups. Also, note
that the intercept is really just another coefficient and should be
reported on too.

The 95% CI for a coefficient is coeff +/- 1.96*stderr.
The 95% CI for the difference between two coefficients is
(coeff1 - coeff2) +/- 1.96*sqrt[(stderr1)^2 + (stderr2)^2].

>
> Thanks again, Ray



Date Subject Author
4/1/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/3/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/3/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/6/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/6/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/6/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/7/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/7/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/8/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/8/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/9/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/9/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/9/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/9/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/9/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/10/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/10/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Art Kendall
4/11/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/13/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/13/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/13/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/15/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/15/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/15/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/15/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/17/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/17/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/17/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/18/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/19/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/19/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
gimpeltf@hotmail.com
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
gimpeltf@hotmail.com
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
gimpeltf@hotmail.com
4/21/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
gimpeltf@hotmail.com
4/21/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/24/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/22/12
Read Our first control group result is precisely how we want AML's program
to behave with control group data ....
Halitsky
4/23/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/24/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/24/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/24/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/24/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/26/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/26/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/26/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/27/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/27/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/27/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
gimpeltf@hotmail.com
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
gimpeltf@hotmail.com
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/29/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/29/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/29/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/29/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/29/12
Read Your questions go to the "moment of truth" that Jacques and Arthur
may shortly be facing
Halitsky
4/29/12
Read Re: Your questions go to the "moment of truth" that Jacques and
Arthur may shortly be facing
Halitsky
4/29/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/30/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
5/1/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
5/1/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
5/1/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
5/1/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
5/1/12
Read Results for c1 study and control groups when model run WITHOUT the
four cells for shortest length interval 33-42
Halitsky
5/2/12
Read Results on c2 fold confirm peculiarity of length interval 33-42 seen
in c1 fold results
Halitsky
5/2/12
Read Model with adjusted weighting logic now works on c2 fold! (but PLEASE
review my new weighting logic !)
Halitsky
5/2/12
Read Actually, your original suggestion (input counts) works as well as
what I just posted for the c2 fold
Halitsky
5/3/12
Read Re: Actually, your original suggestion (input counts) works as well
as what I just posted for the c2 fold
Ray Koopman
5/3/12
Read Re: Actually, your original suggestion (input counts) works as well
as what I just posted for the c2 fold
Halitsky
5/3/12
Read Comparative results for {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}
Halitsky
5/4/12
Read Re: Comparative results for {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}
Ray Koopman
5/4/12
Read Re: Comparative results for {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}
Halitsky
5/4/12
Read Your question re significance of proportion involving counts of
01:10:01:11 inputs to Arthur's program
Halitsky
5/4/12
Read Results of both chi-square tests on all six folds
Halitsky
5/4/12
Read Re: Comparative results for {lnL,w,lnLw,x1,x2} & {lnL,w,lnLw,x1,x2,lnLx1,lnLx2}
Halitsky
5/4/12
Read I'm sending the a1 CI table off-line, since the format didn't hold here.
Halitsky
5/5/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Halitsky
5/5/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Ray Koopman
5/5/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Halitsky
5/7/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Halitsky
5/7/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Halitsky
5/8/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Ray Koopman
5/8/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Halitsky
5/6/12
Read Re: I'm sending the a1 CI table off-line, since the format didn't
hold here.
Ray Koopman
5/3/12
Read Could I ask you to check this 1st test using sum of chi-squares from
study vs control cells ?
Halitsky
5/3/12
Read Re: Could I ask you to check this 1st test using sum of chi-squares
from study vs control cells ?
Ray Koopman
5/3/12
Read Re: Could I ask you to check this 1st test using sum of chi-squares
from study vs control cells ?
Halitsky
5/3/12
Read Could I ask you to check this 1st test using converged likelihoods
for study/control/pooled runs?
Halitsky
5/3/12
Read Re: Could I ask you to check this 1st test using converged
likelihoods for study/control/pooled runs?
Ray Koopman
5/3/12
Read Re: Could I ask you to check this 1st test using converged
likelihoods for study/control/pooled runs?
Halitsky
5/1/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/27/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Ray Koopman
4/28/12
Read Re: Our first control group result is precisely how we want AML's
program to behave with control group data ....
Halitsky
4/24/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/19/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/20/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/16/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/14/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman
4/13/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Halitsky
4/8/12
Read Re: Correct way to normalize an rmsd-based distance metric used in
repeated trials of pairs
Ray Koopman

Point your RSS reader here for a feed of the latest messages in this topic.

[Privacy Policy] [Terms of Use]

© Drexel University 1994-2013. All Rights Reserved.
The Math Forum is a research and educational enterprise of the Drexel University School of Education.