skip to main content
research-article

Conditional Independence Test Based on Residual Similarity

Published: 28 June 2023 Publication History

Abstract

Recently, many regression-based conditional independence (CI) test methods have been proposed to solve the problem of causal discovery. These methods provide alternatives to test CI of x,y given Z by first removing the information of the controlling set Z from x and y, and then testing the independence between the two residuals Rx,Z and Ry,Z. When the residuals are linearly uncorrelated, the independence test between them is nontrivial. With the ability to calculate inner product in high-dimensional space, kernel-based methods are usually used to achieve this goal, but they are considerably time-consuming. In this paper, we test the independence between two linear combinations under linear structural equation model. We show that the dependence between the two residuals can be captured by the difference between the similarity of Rx,Z and Ry,Z and that of Rx,Z and Rr (Rr is an independent copy of Ry,Z) in high-dimensional space. With this result, we provide a new way to test CI based on the similarity between residuals, which is called SCIT — the abbreviation of Similarity-based CI Testing. Furthermore, we develop two versions of the proposal, called Kernel-SCIT and Neural-SCIT, respectively. Kernel-SCIT calculates the similarity by using kernel functions, while Neural-SCIT approximates the upper bound of the similarity by using deep neural networks. In both algorithms, random permutation tests are performed to control Type I error rate. The proposed tests are evaluated on (conditional) independence test and causal discovery with both synthetic and real datasets. Experimental results show that Kernel-SCIT is simpler yet more efficient and effective than the typical existing kernel-based methods HSIC and KCIT in the cases of small sample size, and Neural-SCIT can significantly boost the performance of CI testing when sufficient samples are available. The source code is available at https://github.com/xyw5vplus1/SCIT.

References

[1]
Wicher Pieter Bergsma. 2004. Testing conditional independence for continuous random variables. (2004).
[2]
T. B. Berrett, Y. Wang, R. F. Barber, and R. J. Samworth. 2020. The conditional permutation test for independence while controlling for confounders. Journal of the Royal Statistical Society Series B 82 (2020).
[3]
Ruichu Cai, Zhenjie Zhang, and Zhifeng Hao. 2013. SADA: A general framework to support robust causation discovery. In International Conference on Machine Learning. 208–216.
[4]
J. J. Daudin. 1980. Partial association measures and an application to qualitative regression. Biometrika 67, 3 (1980), 581–590.
[5]
Ilias Diakonikolas and Daniel M. Kane. 2016. A new approach for testing properties of discrete distributions. In 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS’16). IEEE, 685–694.
[6]
Gary Doran, Krikamol Muandet, Kun Zhang, and Bernhard Schölkopf. 2014. A permutation-based kernel conditional independence test. In Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence (Quebec City, Quebec, Canada) (UAI’14). AUAI Press, Arlington, Virginia, USA, 132–141.
[7]
Seth R. Flaxman, Daniel B. Neill, and Alexander J. Smola. 2016. Gaussian processes for independence tests with Non-iid data in causal inference. ACM TIST 7, 2 (2016), 22–1.
[8]
Kenji Fukumizu, Arthur Gretton, Xiaohai Sun, and Bernhard Schölkopf. 2007. Kernel measures of conditional dependence. Advances in Neural Information Processing Systems 20, 1 (2007), 167–204.
[9]
Arthur Gretton, Karsten M. Borgwardt, Malte Rasch, Bernhard Schölkopf, and Alex J. Smola. 2006. A kernel method for the two-sample-problem. In Advances in Neural Information Processing Systems. 513–520.
[10]
Arthur Gretton, Olivier Bousquet, Alexander Smola, and Bernhard Schölkopf. 2005. Measuring statistical dependence with Hilbert-Schmidt norms. In Proceedings Algorithmic Learning Theory. Springer-Verlag, 63–77.
[11]
Moritz Grosse-Wentrup, Dominik Janzing, Markus Siegel, and Bernhard Schölkopf. 2016. Identification of causal relations in neuroimaging data with latent confounders: An instrumental variable approach. NeuroImage 125 (2016), 825–833.
[12]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification. In ICCV. 1026–1034. DOI
[13]
Yue He, Peng Cui, Zheyan Shen, Renzhe Xu, Furui Liu, and Yong Jiang. 2021. DARING: Differentiable causal discovery with residual independence. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD -21). 596–605.
[14]
W. Hoeffding. 1994. A non-parametric test of independence. Annals of Mathematical Stats 19, 4 (1994).
[15]
Kurt Hornik, Maxwell Stinchcombe, and Halbert White. 1989. Multilayer feedforward networks are universal approximators. Neural Networks 2, 5 (1989), 359–366. DOI:
[16]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings.
[17]
Sanghack Lee and Vasant G. Honavar. 2017. Self-discrepancy conditional independence test. In Uncertainty in Artificial Intelligence, Vol. 33.
[18]
Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral normalization for generative adversarial networks. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30–May 3, 2018, Conference Track Proceedings. OpenReview.net. https://openreview.net/forum?id=B1QRgziT-.
[19]
Judea Pearl. 2009. Causality. Cambridge University Press.
[20]
Judea Pearl and Dana MacKenzie. 2018. The Book of Why: The New Science of Cause and Effect. Basic Books.
[21]
Joseph D. Ramsey. 2014. A scalable conditional independence test for nonlinear, non-Gaussian data. arXiv preprint arXiv:1401.5031 (2014).
[22]
C. R. Rao. 2002. Karl Pearson Chi-Square Test the Dawn of Statistical Inference. Goodness-of-Fit Tests and Model Validity.
[23]
K. Sachs, O. Perez, D. A. Lauffenburger, and G. P. Nolan. 2005. Causal protein-signaling networks derived from multiparameter single-cell data. Science 308 (2005).
[24]
Eric V. Strobl, Kun Zhang, and Shyam Visweswaran. 2017. Approximate kernel-based conditional independence tests for fast non-parametric causal discovery. arXiv preprint arXiv:1702.03877 (2017).
[25]
Liangjun Su and Halbert White. 2008. A nonparametric Hellinger metric test for conditional independence. Econometric Theory 24, 04 (2008), 829–864.
[26]
D. H. Wolpert and W. G. MacReady. 1997. No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation 1, 1 (1997), 67–82. DOI:
[27]
Hao Zhang, Shuigeng Zhou, and Jihong Guan. 2018. Measuring conditional independence by independent residuals: Theoretical results and application in causal discovery. In AAAI Conference on Artificial Intelligence.
[28]
Hao Zhang, Shuigeng Zhou, Kun Zhang, and Jihong Guan. 2017. Causal discovery using regression-based conditional independence tests. In AAAI. 1250–1256.
[29]
Hao Zhang, Shuigeng Zhou, Kun Zhang, Jihong Guan, and Ji Zhang. 2021. Testing independence between linear combinations for causal discovery. In AAAI.
[30]
Kun Zhang and Aapo Hyvärinen. 2009. On the identifiability of the post-nonlinear causal model. In Proceedings of the Twenty-fifth Conference on Uncertainty in Artificial Intelligence. AUAI Press, 647–655.
[31]
K. Zhang, J. Peters, D. Janzing, and B. Schölkopf. 2011. Kernel-based conditional independence test and application in causal discovery. AUAI Press, Corvallis, OR, USA, 804–813.
[32]
Qinyi Zhang, Sarah Filippi, Arthur Gretton, and Dino Sejdinovic. 2018. Large-scale kernel methods for independence testing. Statistics and Computing 28 (2018), 113–130.

Cited By

View all
  • (2023)Knowledge Graph Embedding for Topical and Entity Classification in Multi-Source Social Network DataProceedings of the 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3625007.3627315(530-537)Online publication date: 6-Nov-2023
  • (2023)Multi-Label Feature Selection Via Adaptive Label Correlation EstimationACM Transactions on Knowledge Discovery from Data10.1145/360456017:9(1-28)Online publication date: 10-Aug-2023
  • (2023)CDSC: Causal Decomposition Based on Spectral ClusteringInformation Sciences10.1016/j.ins.2023.119985(119985)Online publication date: Dec-2023
  • Show More Cited By

Index Terms

  1. Conditional Independence Test Based on Residual Similarity

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 17, Issue 8
    September 2023
    348 pages
    ISSN:1556-4681
    EISSN:1556-472X
    DOI:10.1145/3596449
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 June 2023
    Online AM: 25 April 2023
    Accepted: 10 April 2023
    Revised: 12 February 2023
    Received: 14 September 2022
    Published in TKDD Volume 17, Issue 8

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Causal discovery
    2. Conditional independence test
    3. Residual similarity

    Qualifiers

    • Research-article

    Funding Sources

    • National Key R&D Program of China
    • National Natural Science Foundation of China (NSFC)
    • China Postdoctoral Science Foundation
    • Guangdong Basic and Applied Basic Research Foundation
    • NSFC

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)189
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 18 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Knowledge Graph Embedding for Topical and Entity Classification in Multi-Source Social Network DataProceedings of the 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3625007.3627315(530-537)Online publication date: 6-Nov-2023
    • (2023)Multi-Label Feature Selection Via Adaptive Label Correlation EstimationACM Transactions on Knowledge Discovery from Data10.1145/360456017:9(1-28)Online publication date: 10-Aug-2023
    • (2023)CDSC: Causal Decomposition Based on Spectral ClusteringInformation Sciences10.1016/j.ins.2023.119985(119985)Online publication date: Dec-2023
    • (2023)Penalized model-based clustering of complex functional dataStatistics and Computing10.1007/s11222-023-10288-233:6Online publication date: 25-Aug-2023

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media