📊 Data & Analysisbeginnercsvdata-analysisinsightsstatistics

Analyze CSV Data and Surface Insights

Paste CSV data and get a structured analysis of key patterns, outliers, correlations, and actionable insights.

The Prompt

prompt.txt
Analyze the following CSV data. Provide:
1. **Data overview:** Number of rows, columns, data types, and any quality issues (nulls, inconsistencies)
2. **Key statistics:** Mean, median, min, max for numeric columns; distribution for categorical
3. **Top insights:** 5-7 most interesting patterns, trends, or anomalies
4. **Correlations:** Any strong relationships between columns worth investigating
5. **Outliers:** Significant outliers and possible explanations
6. **Recommended visualizations:** What charts would best show this data
7. **Follow-up analyses:** 3 questions this data raises that would be worth investigating

CSV data:
```
[PASTE YOUR CSV HERE — or describe the columns and paste a sample]
```

Context: [WHAT THIS DATA IS — e.g., Monthly sales by region Q1-Q4 2025. Goal is to find underperforming regions.]

Example Output

Analysis of 2,400-row sales dataset: identified Q3 as an outlier quarter (23% above trend) driven by a single large deal in the Northeast. Found strong negative correlation (-0.78) between response time and deal close rate. Flagged 12 rows with anomalous negative revenue (likely returns not properly coded).

FAQ

Which AI model is best for Analyze CSV Data and Surface Insights?

Claude Sonnet 4 or GPT-4o — both handle structured data analysis well. Gemini 2.5 Pro for very large datasets.

How do I use the Analyze CSV Data and Surface Insights prompt?

Copy the prompt, replace the [BRACKETED] placeholders with your specific information, and paste into your preferred AI assistant (ChatGPT, Claude, Gemini, etc.). Analysis of 2,400-row sales dataset: identified Q3 as an outlier quarter (23% above trend) driven by a single large deal in the Northeast. Found strong negative correlation (-0.78) between response time and deal close rate. Flagged 12 rows with anomalous negative revenue (likely returns not properly coded).

Model Recommendation

Claude Sonnet 4 or GPT-4o — both handle structured data analysis well. Gemini 2.5 Pro for very large datasets.