awesome-data-analysis is a comprehensive, curated collection of over 500 resources designed for both beginners and experts in data analysis and data science. This GitHub repository offers a wealth of information covering essential topics such as Python, SQL, statistics, machine learning, and artificial intelligence. Users can find valuable tools, libraries, roadmaps, cheatsheets, and interview preparation guides. The resource includes sections on data manipulation with Pandas and NumPy, automated EDA and visualization tools, data quality and validation, feature engineering, and specialized data tools. It also provides extensive resources for SQL and databases, data visualization, dashboards, web scraping, mathematics, statistics, A/B testing, time series analysis, data engineering, NLP, MLOps, and cloud platforms.
Best used for
Ideal for data scientists and students who need to learn new data analysis techniques, find relevant Python libraries, and prepare for data science interviews. Especially valuable for those seeking a structured roadmap and curated resources across various data science domains.
Common actions