首页 > 新书资源
新书资源(2011年4月)

Statistical bioinformatics : a guide for life and biomedical science researchers / edited by Jae K. Lee. — Oxford : Wiley-Blackwell, c2010. – (58.17115/S797)

Contents

    CONTENTS
    
    PREFACE
    CONTR1B UTORS
    1 ROAD TO STATISTICAL BIOINFORMATICS
    Challenge 1: Multiple-Comparisons Issue
    Challenge 2: High-Dimensional Biological Data
    Challenge 3: Small-n and Large-p Problem
    Challenge 4: Noisy High-Throughput Biological Data
    Challenge 5: Integration of Multiple, Heterogeneous Biological Data Information
    References
    2 PROBABILITY CONCEPTS AND DISTRIBUTIONS FOR ANALYZING LARGE BIOLOGICAL DATA
    2.1 Introduction
    2.2 Basic Concepts
    2.3 Conditional Probability and Independence
    2.4 Random Variables
    2.5 Expected Value and Variance
    2.6 Distributions of Random Variables
    2.7 Joint and Marginal Distribution
    2.8 Multivariate Distribution
    2.9 Sampling Distribution
    2.10 Summary
    3 QUALITY CONTROL OF HIGH-THROUGHPUT BIOLOGICAL DATA
    3.1 Sources of Error in High-Throughput Biological Experiments
    3.2 Statistical Techniques for Quality Control
    3.3 Issues Specific to Microarray Gene Expression Experiments
    3.4 Conclusion
    References
    4 STATISTICAL TESTING AND SIGNIFICANCE FOR LARGE BIOLOGICAL DATA ANALYSIS
    4.l Introduction
    4.2 Statistical Testing
    4.3 Error Controlling
    4.4 Real Data Analysis
    4.5 Concluding Remarks
    Acknowledgments
    References
    5 CLUSTERING: UNSUPERVISED LEARNING IN LARGE BIOLOGICAL DATA
    5.1 Measures of Similarity
    5.2 Clustering
    5.3 Assessment of Cluster Quality
    5.4 Conclusion
    References
    6 CLASSIFICATION: SUPERVISED LEARNING WITH HIGH-DIMENSIONAL BIOLOGICAL DATA
    6.1 Introduction
    6.2 Classification and Prediction Methods
    6.3 Feature Selection and Ranking
    6.4 Cross-Validation
    6.5 Enhancement of Class Prediction by Ensemble Voting Methods
    6.6 Comparison of Classification Methods Using High-Dimensional Data
    6.7 Software Examples for Classification Methods
    References
    7 MULTIDIMENSIONAL ANALYSIS AND VISUALIZATION ON LARGE BIOMEDICAL DATA
    7.1 Introduction
    7.2 Classical Multidimensional Visualization Techniques
    7.3 Two-Dimensional Projections
    7.4 Issues and Challenges
    7.5 Systematic Exploration of Low-Dimensional Projections
    7.6 One-Dimensional Histogram Ordering
    7.7 Two-Dimensional Scatterplot Ordering
    7.8 Conclusion
    References
    8 STATISTICAL MODELS, INFERENCE, AND ALGORITHMS FOR LARGE BIOLOGICAL DATA ANALYSIS
    8.1 Introduction
    8.2 Statistical/Probabilistic Models
    8.3 Estimation Methods
    8.4 Numerical Algorithms
    8.5 Examples
    8.6 Conclusion
    References
    9 EXPERIMENTAL DESIGNS ON HIGH-THROUGHPUT BIOLOGICAL EXPERIMENTS
    9.1 Randomization
    9.2 Replication
    9.3 Pooling
    9.4 Blocking
    9.5 Design for Classifications
    9.6 Design for Time Course Experiments
    9.7 Design for eQTL Studies
    References
    10 STATISTICAL RESAMPLING TECHNIQUES FOR LARGE BIOLOGICAL DATA ANALYSIS
    10.1 Introduction
    10.2 Resampling Methods for Prediction Error Assessment and Model Selection
    10.3 Feature Selection
    10.4 Resampling-Based Classification Algorithms
    10.5 Practical Example: Lymphoma
    10.6 Resampling Methods
    10.7 Bootstrap Methods
    10.8 Sample Size Issues
    10.9 Loss Functions
    10.10 Bootstrap Resampling for Quantifying Uncertainty
    10.11 Markov Chain Monte Carlo Methods
    10.12 Conclusions
    References
    11 STATISTICAL NETWORK ANALYSIS FOR BIOLOGICAL SYSTEMS AND PATHWAYS
    11.1 Introduction
    11.2 Boolean Network Modeling
    11.3 Bayesian Belief Network
    11.4 Modeling of Metabolic Networks
    References
    12 TRENDS AND STATISTICAL CHALLENGES IN GENOMEW1DE ASSOCIATION STUDIES
    12.1 Introduction
    12.2 Alleles, Linkage Disequilibrium, and Haplotype
    12.3 International HapMap Project
    12.4 Genotyping Platforms
    12.5 Overview of Current GWAS Results
    12.6 Statistical Issues in GWAS
    12.7 Haplotype Analysis
    12.8 Homozygosity and Admixture Mapping
    12.9 Gene Gene and Gene x Environment Interactions
    12.10 Gene and Pathway-Based Analysis
    12.11 Disease Risk Estimates
    12.12 Meta-Analysis
    12.13 Rare Variants and Sequence-Based Analysis
    12.14 Conclusions
    Acknowledgments
    References
    13 R AND BIOCONDUCTOR PACKAGES IN BIOINFORMATICS: TOWARDS SYSTEMS BIOLOGY
    13.1 Introduction
    13.2 Brief overview of the Bioconductor Project
    13.3 Experimental Data
    13.4 Annotation
    13.5 Models of Biological Systems
    13.6 Conclusion
    13.7 Acknowledgments
    References
    INDEX