Professional Context
With data accuracy KPIs at 95% and query optimization metrics showing a 20% slowdown in data retrieval, the pressure is on to fine-tune our testing workflows and ensure seamless integration with SQL, Python, and Tableau to meet the precision targets of our regression models.
💡 Expert Advice & Considerations
Don't waste your time trying to use Gemini to replace human intuition in testing - instead, focus on using it to augment your data interpretation and automation workflows, and always verify the results with manual testing.
Advanced Prompt Library
4 Expert PromptsAutomated Test Case Generation for ETL Pipelines
Given a Snowflake database schema with 50 tables and a Python-based ETL pipeline using the pandas library, generate a set of automated test cases to validate data integrity and completeness, including tests for data type consistency, null value handling, and data transformation accuracy, using a combination of SQL queries and Python scripts, and provide a statistical summary of the test results, including pass/fail rates, test coverage, and data quality metrics.
Query Optimization for Data Warehousing
Analyze a Tableau dashboard that is experiencing slow load times due to inefficient SQL queries, and provide a rewritten version of the queries using optimized join techniques, indexing, and data partitioning, along with a step-by-step explanation of the changes made and the expected performance improvements, including metrics such as query execution time, data retrieval speed, and CPU usage reduction.
Data Cleaning Script Development for Machine Learning Models
Develop a Python script using the scikit-learn library to clean and preprocess a dataset for use in a regression model, including handling missing values, outliers, and data normalization, and provide a detailed explanation of the data cleaning steps, including code snippets, data visualizations, and statistical summaries, and evaluate the impact of the cleaning process on the model's precision and recall metrics.
Statistical Analysis of Test Results for Model Validation
Given a set of test results from a regression model, including predicted values, actual values, and residual errors, perform a statistical analysis to validate the model's performance, including calculations of mean absolute error, mean squared error, and R-squared, and provide a detailed interpretation of the results, including data visualizations, correlation analysis, and recommendations for model improvement, using a combination of Python, R, and SQL to analyze the data and generate the statistical summaries.