Professional Context
Balancing the daily grind of data analysis with the urgency of outbreak investigations, epidemiologists must navigate the tension between meticulous research and rapid response, all while ensuring the accuracy and reliability of their findings. As the pressure to deliver timely insights mounts, the luxury of thorough investigation is often at odds with the need for swift action, forcing epidemiologists to make tough choices about where to focus their attention.
💡 Expert Advice & Considerations
Don't bother using Gemini to generate flashy reports, focus on using it to automate the tedious data cleaning and processing tasks that suck up 90% of your time.
Advanced Prompt Library
4 Expert PromptsOutbreak Investigation Data Preparation
Using the provided dataset of COVID-19 cases from the past 6 months, perform the following tasks: 1) clean and preprocess the data by handling missing values, converting date fields to a standard format, and normalizing the geographic location fields; 2) generate a set of descriptive statistics, including mean, median, and standard deviation for key variables such as age, sex, and symptom severity; 3) create a preliminary visualization of the outbreak's spatial distribution using a heatmap or scatter plot; and 4) identify the top 5 most significant factors associated with increased risk of hospitalization using a logistic regression model. Assume the data is stored in a Google BigQuery dataset named 'covid19_cases' and the results should be written to a new dataset named 'outbreak_investigation'.
Epidemiologic Study Design and Sampling Strategy
Design a retrospective cohort study to investigate the association between exposure to air pollution and the incidence of respiratory disease in a population of children under the age of 12. Specify the following: 1) the study population and inclusion/exclusion criteria; 2) the sampling frame and method (e.g., random sampling, stratified sampling); 3) the exposure assessment strategy (e.g., using satellite data, air quality monitors); 4) the outcome measurement and case definition; and 5) the proposed sample size and power calculation. Assume a minimum of 1000 participants and a maximum of 5000 participants, and provide a detailed justification for the chosen sample size.
Disease Surveillance and Monitoring Dashboard
Create a real-time dashboard to monitor and track the spread of influenza across different regions using Google Data Studio. The dashboard should include the following components: 1) a geographic map displaying the current outbreak hotspots; 2) a time-series plot showing the weekly incidence rates over the past 12 weeks; 3) a bar chart comparing the vaccination coverage across different age groups; and 4) a table listing the top 5 regions with the highest incidence rates. Use data from the Google Cloud COVID-19 dataset and incorporate data from the CDC's FluView dashboard. Provide a detailed description of the data sources, visualization choices, and any necessary data transformations.
Meta-Analysis of Intervention Effectiveness
Conduct a systematic review and meta-analysis of the literature on the effectiveness of mask-wearing in preventing the transmission of respiratory viruses. Perform the following tasks: 1) search and screen the literature using PubMed and Google Scholar; 2) extract and code the relevant study characteristics, including study design, population, intervention, and outcome measures; 3) assess the quality of the included studies using the Cochrane Risk of Bias Tool; 4) conduct a random-effects meta-analysis to estimate the pooled effect size; and 5) explore the sources of heterogeneity using subgroup analyses and meta-regression. Provide a detailed description of the search strategy, inclusion/exclusion criteria, and any necessary data transformations.