<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0">
<channel>
<title>Columbia University Biostatistics Technical Report Series</title>
<copyright>Copyright (c) 2013 Columbia University All rights reserved.</copyright>
<link>http://biostats.bepress.com/columbiabiostat</link>
<description>Recent documents in Columbia University Biostatistics Technical Report Series</description>
<language>en-us</language>
<lastBuildDate>Mon, 18 Mar 2013 13:50:27 PDT</lastBuildDate>
<ttl>3600</ttl>








<item>
<title>Meta-Analysis of “Sparse” Data: Perspectives From the Avandia Cases</title>
<link>http://biostats.bepress.com/columbiabiostat/art18</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art18</guid>
<pubDate>Mon, 23 Apr 2012 17:31:32 PDT</pubDate>
<description>
	<![CDATA[
	<p>Combining the results of multiple small trials to increase accuracy and statistical power, a technique called meta-analysis has become well established and increasingly important in medical studies, particularly in connection with new drugs. When the data are sparse, as they are in many such cases, certain accepted practices, applied reflexively by researchers, may be misleading because they are biased and for other reasons. We illustrate some of the problems by examining a meta-analysis of the connection between the diabetes drug Avandia (rosiglitazone) and myocardial infarction that was strongly criticized as misleading, but led to thousands of lawsuits being filed against the manufacturer and the FDA acting to restrict access to the drug. Our scrutiny of the Avandia meta-analysis is particularly appropriate because it plays an important role in ongoing litigation, has been sharply criticized, and has been subject to a more searching review in court than meta-analyses of other drugs.</p>

	]]>
</description>

<author>Michael O. Finkelstein et al.</author>


</item>






<item>
<title>Formulas for Correcting the Mean and Variance of the Full-Data Sample Mean of the Primary Endpoint Under the Dose Selected at Stage One in a Two-Stage Trial with Selection Between Two Active Doses</title>
<link>http://biostats.bepress.com/columbiabiostat/art17</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art17</guid>
<pubDate>Wed, 15 Sep 2010 17:25:36 PDT</pubDate>
<description>
	<![CDATA[
	<p>Formulas are provided in the normal distribution case to correct the mean and variance of the sample mean from a two-stage phase II study in which a preferred dose is selected in the first stage, the second stage collects additional data with the selected dose, and then a non-superiority hypothesis is tested with the data from both stages.</p>

	]]>
</description>

<author>Bruce Levin</author>


</item>






<item>
<title>Losses To Follow-Up In WARSS Collaboration Datasets: A Detailed Statistical Presentation of the Imputation Procedures</title>
<link>http://biostats.bepress.com/columbiabiostat/art16</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art16</guid>
<pubDate>Sat, 05 Dec 2009 16:06:54 PST</pubDate>
<description>
	<![CDATA[
	<p>This document prospectively records the procedures which will be used for handling losses to follow-up (LTF) in statistical analyses of WARSS data.  They were developed by B Levin Ph.D. (WARSS senior statistical consultant) and JLP Thompson Ph.D. (WARSS statistician), and have been approved by the SOCC (Statistical Oversight and Coordinating Committee of the WARSS collaboration).  They have been accepted by the WARSS Principal Investigator (JP Mohr M.D.), and also by the Principal Investigators of APASS, PICSS, HAS, and GENESIS for use in these collaborating studies.</p>

	]]>
</description>

<author>John L.P. Thompson et al.</author>


</item>






<item>
<title>Appendix to &quot;Bush v. Gore: Two Neglected Lessons from a Statistical Perspective&quot; by Michael O. Finkelstein and Bruce Levin</title>
<link>http://biostats.bepress.com/columbiabiostat/art15</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art15</guid>
<pubDate>Sun, 22 Feb 2009 15:34:13 PST</pubDate>
<description>
	<![CDATA[
	<p>This is the appendix to the article entitled, "Bush v. Gore: Two Neglected Lessons from a Statistical Perspective" which was published in Jurimetrics 44(2):181-200 (2004).</p>

	]]>
</description>

<author>Michael O. Finkelstein et al.</author>


</item>






<item>
<title>On a Conjecture of Bechhofer, Kiefer, and Sobel for the Levin-Robbins-Leu Binomial Subset Selection Procedures</title>
<link>http://biostats.bepress.com/columbiabiostat/art14</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art14</guid>
<pubDate>Sun, 27 Jan 2008 14:17:44 PST</pubDate>
<description>
	<![CDATA[
	<p>We state a general formula which provides a lower bound for the probability of various types of acceptable subset selection with the Levin-Robbins-Leu binomial subset selection procedure without elimination or recruitment.  We prove the truth of a conjecture of Bechhofer, Kiefer, and Sobel for this procedure by applying the general lower bound.  We also introduce a simple modification that allows sequential elimination of inferior populations and recruitment of superior populations.  Numerical evidence indicates that the new procedure also obeys the general lower bound while reducing the expected number of observations and failures compared with non-adaptive methods.</p>

	]]>
</description>

<author>Cheng-Shiun Leu et al.</author>


</item>






<item>
<title>Handling Missing Data by Deleting Completely Observed Records </title>
<link>http://biostats.bepress.com/columbiabiostat/art13</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art13</guid>
<pubDate>Thu, 19 Apr 2007 10:08:36 PDT</pubDate>
<description>
	<![CDATA[
	<p>When data are missing, analyzing records that are completely observed may cause bias or ineffciency. Existing approaches in handling missing data include likelihood, imputation and inverse probability weighting. In this paper, we propose three estimators inspired by deleting some completely observed data in regression setting. First, we generate artificial observation indicators that are independent of outcome given observed data and draw inferences conditioning on the artificial observation indicators. Second, we propose a closely related weighting method. The proposed weighting method has more stable weights than those of the inverse probability weighting method (Zhao and Lipsitz, 1992). Third, we improve the e±ciency of the proposed weighting estimator by subtracting the projection of the estimating function onto the nuisance tangent space. When data are missing completely at random, we show that the proposed estimators have asymptotic variances smaller than or equal to the variance of the estimator obtained from using completely observed records only. Asymptotic relative effciency computation and simulation studies indicate that the proposed weighting estimators are more e±cient than the inverse probability weighting estimators under wide range of practical situations especially when the missingness propor- tion is large.</p>

	]]>
</description>

<author>Cuiling Wang et al.</author>


</item>






<item>
<title>An Improved Kernel Assisted Imputation Method in Missing Covariate Regression</title>
<link>http://biostats.bepress.com/columbiabiostat/art12</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art12</guid>
<pubDate>Thu, 19 Apr 2007 10:04:26 PDT</pubDate>
<description>
	<![CDATA[
	<p>There are several methods of handling missing data in a non-likelihood framework, imputation and the inverse probability weighting method being the two main approaches. These methods require auxiliary mod- els, namely the probability of observation for the inverse probabil- ity and the conditional distribution of missing data and their correct specification. Wang and Wang (2001) proposed a kernel method for these auxiliary models and investigated the relationship among various kernel-assisted methods and showed some asymptotic equivalence. In this paper we delve into some questions arisen from Wang and Wang (2001). We first derive an improved imputation method which subtracts the projection of the original imputation score onto the nuisance tangent space. Further, we look into the asymptotic behavior of our method and do a comparison with those described in Wang and Wang (2001). Our results are in contrast with those of Wang and Wang (2001) in that the projection from the improved imputation method becomes negligible when the conditional expectation is estimated. It turns out that under the conditions where our projection becomes negligible, the projection from the inverse probability estimating function is of nonnegligible order.</p>
<p>Keywords: kernel estimator, Imputation methods, Improved imputation methods, Semiparametric methods</p>

	]]>
</description>

<author>Hui Zhang et al.</author>


</item>






<item>
<title>Importance Sampling Estimators for Cox Regression With Missing Covariates</title>
<link>http://biostats.bepress.com/columbiabiostat/art11</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art11</guid>
<pubDate>Wed, 18 Apr 2007 12:08:59 PDT</pubDate>
<description>
	<![CDATA[
	<p>Missingness in covariates is a common problem in survival data. In this article, we propose an importance sampling method for estimating the regression parameters in the proportional hazards model with missing covariates. We also consider the augmented importance sampling method by subtracting the projection term onto the nuisance tangent space. The proposed methods provide consistent and asymptotically normally distributed estimators when the missing-data mechanism depends on outcome variables as well as the observed covariates. Simulation results indicate that the proposed importance sampling estimators are more efficient than the inverse probability weighting estimators for the regression coefficients of the missing covariates, and equally as efficient as or more efficient than the inverse probability weighting estimators for the regression coefficients of the completely observed covariates.</p>

	]]>
</description>

<author>Qiang  Xu et al.</author>


</item>






<item>
<title>On Calculating Maximum Rank One Underapproximations for Positive Arrays</title>
<link>http://biostats.bepress.com/columbiabiostat/art10</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art10</guid>
<pubDate>Fri, 30 Mar 2007 20:13:30 PDT</pubDate>
<description>
	<![CDATA[
	<p>Given P, a rectangular array with positive elements, a rank one underapproximation for P is given by two positive vectors, say r and s, such that each component of rs' is no greater than the corresponding component of P, whence P can then be written as P=(pi)rs'+(1-pi)D for some constant pi, where the residual matrix D is non-negative.  A maximal rank one underapproximation for P is such that pi is maximized over all possible rank one underapproximations for P.</p>
<p>We provide an algorithm for calculating the maximum rank one underapproximation and corresponding pi.  We present an explicit expression in the special case of 2 x c tables, and show that the algorithm yields the correct solution in this case.</p>
<p>Note: this report was originally entitled "Technical Report No. B-48, January 1985" in the Columbia Biostatistics Tech Report Series.  Due to early font styles, strict and non-strict inequalities may be difficult to distinguish in the scanned version.  Magnifying the typeface will help.</p>

	]]>
</description>

<author>Bruce Levin</author>


</item>






<item>
<title>Quadratic Distances On Probabilities: A Unified Foundation</title>
<link>http://biostats.bepress.com/columbiabiostat/art9</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art9</guid>
<pubDate>Thu, 29 Mar 2007 07:22:17 PDT</pubDate>
<description>
	<![CDATA[
	<p>This work builds a unified framework for the study of quadratic form distance measures as they are used in assessing the goodness of fit of models. Many important procedures have this structure, but the theory for these methods is dispersed and incomplete. Central to the statistical analysis of these distances is the spectral decomposition of the kernel that generates the distance. We show how this determines the limiting distribution of natural goodness of fit tests. Additionally, we develop a new notion, the spectral degrees of freedom of the test, based on this decomposition. The degrees of freedom are easy to compute and estimate, and can be used as a guide in the construction of useful procedures in this class.</p>

	]]>
</description>

<author>Bruce G. Lindsay et al.</author>


</item>






<item>
<title>A Least Squares Estimation in Truncated Linear Regression</title>
<link>http://biostats.bepress.com/columbiabiostat/art8</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art8</guid>
<pubDate>Thu, 22 Mar 2007 12:21:51 PDT</pubDate>
<description>
	<![CDATA[
	<p>We investigate least squares estimation for regression coefficients of the covariates in the multiple linear regression model with truncated data and propose an alternative consistent least squares type estimator to the existing ones.  The estimator is proved to have an asymptotic normal distribution with the same asymptotic variance matrix as the estimator proposed by Lai and Ying (1992b). However, the estimator is much simpler in computation than Lai and Ying's estimator.  The estimation procedure does not require calculation of the nonparametric estimate of the error distribution. A simulation study shows that the estimator performs well even with a moderate sample size.</p>

	]]>
</description>

<author>Wei Yann Tsai et al.</author>


</item>






<item>
<title>On Minimizing the Lower Bound for the Probability of Acceptable Subset Selection</title>
<link>http://biostats.bepress.com/columbiabiostat/art7</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art7</guid>
<pubDate>Fri, 15 Dec 2006 14:29:00 PST</pubDate>
<description>
	<![CDATA[
	<p>This report identifies the configuration that minimizes the (s,t)-lower bound for the probability of theta-acceptable subset selection, which is involved in the proof of a conjecture of Bechhofer, Kiefer, and Sobel for the Levin-Robbins-Leu binomial subset selection procedure with sequential elimination and recruitment.</p>

	]]>
</description>

<author>Bruce Levin</author>


</item>






<item>
<title>Proof of the Lower Bound Formula for the Probability of Correct Binomial Subset Selection with the Levin-Robbins-Leu Sequential Elimination and Recruitment Procedure in the Case b=2, c=4.</title>
<link>http://biostats.bepress.com/columbiabiostat/art6</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art6</guid>
<pubDate>Fri, 15 Dec 2006 14:17:55 PST</pubDate>
<description>
	<![CDATA[
	<p>We prove the validity of a lower bound for the probability of correct subset selection for choosing the best b=2 binomial populations out of c=4.  The procedure considered combines sequential elimination of inferior populations and simultaneous sequential recruitment of superior populations.  This extends the results of Leu and Levin (2004) to this new procedure.</p>

	]]>
</description>

<author>Cheng-Shiun Leu et al.</author>


</item>






<item>
<title>Decision trees for instantaneous hazard rates with adjustment for covariate effects</title>
<link>http://biostats.bepress.com/columbiabiostat/art5</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art5</guid>
<pubDate>Tue, 27 Jun 2006 17:25:46 PDT</pubDate>
<description>
	<![CDATA[
	<p>We investigate the problem of finding confidence sets for a threshold in the baseline hazard function of the Cox proportional hazards model. The aim  is to find an effective way of condensing information in the baseline into a small number of estimable parameters. A binary decision tree is used as a working model,  with the jump point (threshold) representing a time at which baseline risk  changes abruptly between two levels. We find the asymptotic distribution of estimators of best-fitting  parameters under an arbitrary misspecification of  the working model.   The estimators converge at cube-root rate to a non-normal limit distribution.  Two alternate ways of constructing confidence intervals for the threshold are compared.  Results from a simulation study and an example concerning a threshold for the age of onset of schizophrenia in a large cohort study are discussed.</p>

	]]>
</description>

<author>Moulinath  Banerjee et al.</author>


</item>






<item>
<title>Technical Report #B-92---Formulas for the Exact Probability of Correct Selection in the Binomial Levin-Robbins Sequential Selection Procedure in the Cases b=2, c=3 and b=2, c=4 for r=1</title>
<link>http://biostats.bepress.com/columbiabiostat/art4</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art4</guid>
<pubDate>Thu, 08 Jun 2006 19:01:22 PDT</pubDate>
<description>
	<![CDATA[
	<p>The purpose of this report is to record some explicit expressions for the probability of correct selection in the Levin-Robbins procedure for selecting the best b out of c coins in the two cases in which such expressions are feasible.</p>

	]]>
</description>

<author>Bruce Levin et al.</author>


</item>






<item>
<title>Tech Report #B-91---Selecting the Best Subset of b Out of c Coins with the Levin-Robbins Sequential Elimination Procedure:  Proof of the Lower Bound Formula for the Probability of Correct Selection in the Case b=2 and c=4.</title>
<link>http://biostats.bepress.com/columbiabiostat/art3</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art3</guid>
<pubDate>Thu, 08 Jun 2006 18:54:19 PDT</pubDate>
<description>
	<![CDATA[
	<p>The purpose of this report is to prove the validity of the lower bound formula  for a sequential identification procedure with elimination of inferior coins in the special case of selecting the best b = 2 out of c = 4 coins for any r greater than or equal to 1. This is the first non-trivial case of a general conjecture not already proven in previous work and suggests a rigorous proof of the general conjecture may yet be possible.</p>

	]]>
</description>

<author>Cheng-Shiun Leu et al.</author>


</item>






<item>
<title>A Generalization of the Levin-Robbins Procedure for Binomial Subset Selection and Recruitment Problems</title>
<link>http://biostats.bepress.com/columbiabiostat/art2</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art2</guid>
<pubDate>Thu, 11 May 2006 14:17:46 PDT</pubDate>
<description>
	<![CDATA[
	<p>We introduce a family of sequential selection and recruitment procedures for the subset identification problem in binomial populations.  We demonstrate the general validity of a simple formula providing a lower bound for the probability of correct identification in a version of the family without sequential elimination or recruitment.  A new application of the noncentral hypergeometric distribution is revealed.  A similar theorem is conjectured to hold for the more efficient version which employs sequential elimination or recruitment.</p>
<p>Keywords: lower bound formula, probability of correct selection, recruitment, selection, sequential identification.</p>

	]]>
</description>

<author>Cheng-Shiun Leu et al.</author>


</item>






<item>
<title>A Note on the Censoring Problem in Empirical Case-Outcome Studies</title>
<link>http://biostats.bepress.com/columbiabiostat/art1</link>
<guid isPermaLink="true">http://biostats.bepress.com/columbiabiostat/art1</guid>
<pubDate>Wed, 10 May 2006 13:38:18 PDT</pubDate>
<description>
	<![CDATA[
	<p>In studies of the legal system investigators may collect information about cases within a study window and compile statistical information about their outcomes.  Because there is frequently a long delay between the start time for cases and their resolution, a significant number of cases may be pending at the close of the study window.   If there is a correlation between the outcome variable and being censored, exclusion of censored cases may bias the analysis in the sense that the reported outcomes will be systematically different from what would be reported if all the censored cases were followed to completion and included in the data.  A prime example, which we will use to illustrate our approach, is the landmark study of reversals in death penalty cases in the state courts that was authored by a team led by Professor James S. Liebman of Columbia Law School.  Two equivalent ways of estimating outcome rates accounting for the censored cases is the subject of this article.</p>

	]]>
</description>

<author>Michael  O.  Finkelstein et al.</author>


</item>





</channel>
</rss>
