data-sciencetoolstools Apache Kafka Apache Iceberg Apache Spark Apache Flink Apache Airflow Apache Beam Postgres Redshift Kubernetes Python Jupyter Notebook