Network analysis of questionnaire data

来源 :The 24th International Workshop on Matrices and Statistics(第 | 被引量 : 0次 | 上传用户：haojie831001

【摘要】

：

　　In the human sciences, the way people think, feel and act is thought to reflect their psychological properties. Data is often gathered using questionnaires,

【作者】

：

Markus Mattsson

【机构】

：

UniversityofHelsinki,Finland

【出处】

：

The 24th International Workshop on Matrices and Statistics(第

【发表日期】

：

2015年5期

【关键词】

：

Network analysis correlation matrix partial correlation matrix Markov Random Fie

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　In the human sciences, the way people think, feel and act is thought to reflect their psychological properties. Data is often gathered using questionnaires, and different forms of factor analysis are used as the default analysis method. Factor analysis is based on the idea that the covariation of variables related to different forms of human behavior reflects the underlying psychological properties. But could we do without latent variables? In this presentation, I discuss the results of modeling data from a psychometric questionnaire as a Markov Random Field (MRF), in which the different forms of behavior function as random variables. The analysis is based on forming an undirected, weighted network that is encoded in a weights matrix. As a first stage of the analysis - and not yet related to MRFs - it is beneficial to use raw correlations among the variables as weights and to represent the weights matrix as a graph. Correlations can be used as weights because correlations are undirected and weighted entities, with zero correlation representing no relationship. This first step is useful for obtaining an overall understanding of the interrelationships of the variables in the data. The correlation matrix, however, suffers from the problem of confounding: the observed correlation between any two variables may be due to both of them being related to one or more other variables in the data. Because of this, as the second stage of the analysis, a partial correlation matrix is calculated. This matrix is directly related to the inverse of the correlation matrix, and can be calculated through the use of several linear regressions. Under the multivariate normality assumption, two variables are conditionally independent if their partial correlation (conditioning on all other variables included in the network) is zero. This applies at the population level, but in the sample data exact zeroes are unlikely. For this reason, the partial correlations are calculated based on estimating the corresponding regression models using the adaptive least absolute shrinkage and selection operator (adaptive LASSO) estimator. In adaptive LASSO estimation, a penalized likelihood is maximized, with the penalty being based on the value of the extended Bayesian Information Criterion (eBIC). Small partial correlations then shrink to zero with the aim of converging on the hypothesized population-level Markov Random Field model.The network thus formed can be described using various centrality and clustering coefficients. When working with weighted networks, many of these coefficients can be calculated based on the connection weights, even though they were originally formulated for the non-weighted case. The issue of calculating the value of such coefficients based on either the presence of connections or their weights is discussed. Finally, from the teachers perspective, the network models show promise as a teaching tool in the behavioral sciences: the graphs can be used as a visual tool that allows the students to obtain an intuitive understanding of highdimensional, complex data.

其他文献

Stability of Non-densely Defined Semilinear Stochastic Evolution Equations with Application to the S

　　We first investigate the stability of non-densely defined semilinear stochastic evolution equations.For this system,using the method of Lyapunov functional

会议

Micelle Engineering via Crystallization-Driven Self-Assembly

　　Although chemical synthesis has evolved to a relatively advanced state,the ability to prepare well-defined self-assembled materials of controlled shape,size

会议

Stochastic Fluctuations in Suspensions of Swimming Microorganisms

　　The collective dynamics of swimming microorganisms("microswimmers")such as bacteria and algal cells have been of considerable recent interest,both as paradi

会议

Synthesis of Water Soluble Poly(4-hydroxybutyrate) Derivative with Rapid Degradation via Intramolecu

　　We report here a new type of rapidly degradable poly(4-hydroxybutyrate)(P4HB)derivative at room temperature in aqueous solution.The polyester was synthesize

会议

Difference set of primes and related problem

　　In this talk,we will review some progress on the set of the difference of primes.Some related problems will be discussed.

会议

Ruelle Operators and Decay of Correlations

　　The Ruelle operator associated with a dynamical system and a given potential relates closely with the decay of correlation.In this talk,I will discuss the s

会议

Dynamical time-series analysis for Morse decomposition-an application to meteorological data with no

　　A new dynamical time-series analysis is proposed,which can detect not only attractors but also unstable objects.As an application,we studymeteorological rea

会议

Mean-square random dynamical systems

　　Mean-square random dynamical systems are essentially deterministic nonautonomous dynamical systems defined in terms of a two-parameter semigroup acting on a

会议

On the Limit Measure's Behavior of Invariant Measures for Stochastic Approximation and Stochast

　　The limit behavior of a family of invariant measures for various stochastic evolutionsystems,which include stochastic approximation with constant step ?,sto

会议

A kinetic theory of age-structured stochastic birth-deathprocesses

　　Classical age-structured mass-action models e.g.,the McKendrick-von Foerster equation)have been widely studied andapplied but cannot describe stochastic flu

会议

Network analysis of questionnaire data

其他学术论文