Semiparametric regression
In: Cambridge series on statistical and probabilistic mathematics
6 Ergebnisse
Sortierung:
In: Cambridge series on statistical and probabilistic mathematics
In: Decision sciences, Band 14, Heft 2, S. 253-269
ISSN: 1540-5915
ABSTRACTThe purpose of this research is to show the usefulness of three relatively simple nonlinear classification techniques for policy‐capturing research where linear models have typically been used. This study uses 480 cases to assess the decision‐making process used by 24 experienced national bank examiners in classifying commercial loans as acceptable or questionable. The results from multiple discriminant analysis (a linear technique) are compared to those of chi‐squared automatic interaction detector analysis (a search technique), log‐linear analysis, and logit analysis. Results show that while the four techniques are equally accurate in predicting loan classification, chi‐squared automatic interaction detector analysis (CHAID) and log‐linear analysis enable the researcher to analyze the decision‐making structure and examine the "human" variable within the decision‐making process. Consequently, if the sole purpose of research is to predict the decision maker's decisions, then any one of the four techniques turns out to be equally useful. If, however, the purpose is to analyze the decision‐making process as well as to predict decisions, then CHAID or log‐linear techniques are more useful than linear model techniques.
In: Political analysis: PA ; the official journal of the Society for Political Methodology and the Political Methodology Section of the American Political Science Association, Band 25, Heft 2, S. 223-240
ISSN: 1476-4989
Media-based event data—i.e., data comprised from reporting by media outlets—are widely used in political science research. However, events of interest (e.g., strikes, protests, conflict) are often underreported by these primary and secondary sources, producing incomplete data that risks inconsistency and bias in subsequent analysis. While general strategies exist to help ameliorate this bias, these methods do not make full use of the information often available to researchers. Specifically, much of the event data used in the social sciences is drawn from multiple, overlapping news sources (e.g., Agence France-Presse, Reuters). Therefore, we propose a novel maximum likelihood estimator that corrects for misclassification in data arising from multiple sources. In the most general formulation of our estimator, researchers can specify separate sets of predictors for the true-event model and each of the misclassification models characterizing whether a source fails to report on an event. As such, researchers are able to accurately test theories on both the causes of and reporting on an event of interest. Simulations evidence that our technique regularly outperforms current strategies that either neglect misclassification, the unique features of the data-generating process, or both. We also illustrate the utility of this method with a model of repression using the Social Conflict in Africa Database.
Media-based event data—i.e., data comprised from reporting by media outlets—are widely used in political science research. However, events of interest (e.g., strikes, protests, conflict) are often underreported by these primary and secondary sources, producing incomplete data that risks inconsistency and bias in subsequent analysis. While general strategies exist to help ameliorate this bias, these methods do not make full use of the information often available to researchers. Specifically, much of the event data used in the social sciences is drawn from multiple, overlapping news sources (e.g., Agence France-Presse, Reuters). Therefore, we propose a novel maximum likelihood estimator that corrects for misclassification in data arising from multiple sources. In the most general formulation of our estimator, researchers can specify separate sets of predictors for the true-event model and each of the misclassification models characterizing whether a source fails to report on an event. As such, researchers are able to accurately test theories on both the causes of and reporting on an event of interest. Simulations evidence that our technique regularly out performs current strategies that either neglect misclassification, the unique features of the data-generating process, or both. We also illustrate the utility of this method with a model of repression using the Social Conflict in Africa Database.
BASE
In: Risk analysis: an international journal, Band 17, Heft 3, S. 321-332
ISSN: 1539-6924
Exposure‐response analysis of acute noncancer risks should consider both concentration (C) and duration (T) of exposure, as well as severity of response. Stratified categorical regression is a form of meta‐analysis that addresses these needs by combining studies and analyzing response data expressed as ordinal severity categories. A generalized linear model for ordinal data was used to estimate the probability of response associated with exposure and severity category. Stratification of the regression model addresses systematic differences among studies by allowing one or more model parameters to vary across strata denned, for example, by species and sex. The ability to treat partial information addresses the difficulties in assigning consistent severity scores. Studies containing information on acute effects of tetrachloroethylene in rats, mice, and humans were analyzed. The mouse data were highly uncertain due to lack of data on effects of low concentrations and were excluded from the analysis. A model with species‐specific concentration intercept terms for rat and human central nervous system data improved fit to the data compared with the base model (combined species). More complex models with strata denned by sex and species did not improve the fit. The stratified regression model allows human effect levels to be identified more confidently by basing the intercept on human data and the slope parameters on the combined data (on a C × T plot). This analysis provides an exposure–response function for acute exposures to tetrachloroethylene using categorical regression analysis.
In: Communications in statistics. Theory and methods, Band 25, Heft 11, S. 2615-2632
ISSN: 1532-415X