~ 文章, Blog, 教程...

Airflow is a popular pipeline orchestration tool for Python that allows users to configure complex (or simple!) multi-system workflows that are executed in parallel across any number of workers. A single pipeline might contain bash, Python, and SQL operations. With dependencies specified between tasks, Airflow knows which ones it can run in parallel and which ones must run after others. Airflow is written in Python and users can add their own operators with custom functionality, doing anything Python can do.


今年 PyCon 上出现的仙器,多后端/Pythonic 工作流/管道管理平台 )

I was preparing to push some changes a couple of days ago and as I usually do, I ran the tests. I sat back in my chair as the dots raced across the screen when suddenly I noticed that one of the dots linger. ”OS is probably running some updates in the background or something” I said to myself, and ran the tests again just to be sure. I watched closely as the dots filled the screen and there it was again?—?I have a slow test!


Matrix-样 数据观察形式看来是正确的 )

Good Tutorial on using pdb.


..It is not necessary to use pdb all the time 嗯哼,作者都说的很明白,其实大家都清楚,动用 pdb 的情景都是不得不作 接盘侠 时, 面对纠结在一起的代码时,不得不进行的刺探, 因为没有自信自己在看过所有代码,将思想扭曲为当初那位崩溃的程序猿相同状态后, 是否能恢复清明... )

Talk proposals will be due on 2017 January 3.Poster proposals will be due on 2017 January 3.Tutorial proposals are due on 2017 November 30. Yes, that’s right — tutorial proposals are due in three weeks.

A curated list of awesome Python asyncio frameworks, libraries, software and resources.


是 github 带领中国程序猿重新认识了 awesome 这词儿,

现在最高效的领域技术搜索技巧就是在 github 中搜索 awesome+ 技术名 )

I've recently been exploring the exciting new world of asynchronous I/O libraries in Python 3 – specifically asyncio and curio. These two libraries make some different design choices. This is an essay that I wrote to try to explain to myself what those differences are and why I think they matter, and distill some principles for designing event loop APIs and asynchronous libraries in Python.


Py3 中内建的 asynciocurio 成为两大焦点都在进行折腾... )

An extension for Django admin that makes interface mobile friendly.

Cloud Dataflow provides a fully-managed, autoscaling, serverless execution environment for data pipelines written in Apache Beam. In this article Lak Lakshmanan and Matt Hancher show us how to create a monthly vegetation index from Landsat images, available as a public dataset.


GCP 中的 Cloud Dataflow 支持 Apache Beam 可以发布无主机数据处理流程... )


~ 包/模块/库/片段...

支持使用 XPath 语法来查询 Python ASTs 的 CLI 工具

针对移动端安全测试的 Android 模拟器


怎么看也不象能在 M$ 环境中跑的工具 )

ByteNet for character-level language modelling

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.



word_forms )

FOSS 心跳分析, 数据来自社区志愿者 :heartbeat:

适用于 Python 2/3 的简易变量绑定与多行输出刷新的库



华南理工大学在校学生,国人作品 ;-)


JET is a different approach to make numeric python substantially faster



嗯哼,更快的 numpy,,,随着数据科学的兴起, 作为基础中的基础作品 numpy 在遭受各种优化/加速 )

A simple Batch Image Downloader using Python and BeautifulSoup.


美汤!? 弃疗... 孟加拉 的老兄弟... )

Package / Module importer for importing code from Jupyter Notebook files (.ipynb)


ipynb 已经开始构建完备的生态了哪... )

Bridge between IRC and Slack


IRC 上古神器, Slack 今墙外仙器.. )


~ 双 11 了,人造节日中可能最费銭的一个,大家都砍了什么?!


comments powered by Disqus