Excel+Py 就是数据科学的未来?

原文: PyCoder's Weekly - Issue #477

pycoder

  • 210616 Zoom.Quiet(大妈) 用时 42 分钟 完成快译
  • 210616 Zoom.Quiet(大妈) 用时 37 分钟 完成格式转抄.

What’s the most widely used tool in data science? Is it pandas or NumPy? Is it the Python language itself? Not really. It’s Excel. You might argue that data scientists aren’t using Excel as their primary tool, and you might be right. But Excel enables non-technical users, like small business owners, to gain insights into their data. In this article, Anaconda CEO Peter Wang discusses his goal of making Python and PyData the “conceptual successor” to Excel.

(是也乎:

对的, 无论 Jupyter 怎么折腾, 如果 Excel 嵌入 Python, 那么瞬间, 没 notebook 什么事儿了...

)

If you need to do some data analysis, what tool do you reach for first? Is it pandas? While pandas is great, it comes with some costs that you might not be aware of, including large memory overhead that can quickly get in the way of your projects. Using databases with SQL can alleviate memory issues. In this tutorial, you’ll learn how to do common data analysis tasks in SQL, which opens the door to mixing SQL and pandas to create lightweight programs that are also fast!

(是也乎:

SQL 永远的语言

)

In this tutorial, you’ll prepare for future interviews by working through a set of Python practice problems that involve CSV files. You’ll work through the problems yourself and then compare your results with solutions developed by the Real Python team.

(是也乎:

常年手工解析 .csv 的俺...

)

(是也乎:

嗯哼? 这个热点蹭得专业哪...

)

(是也乎:

打死也不发布 1.0 党

)


探讨/吐糟

Discussions

Chaining together many object methods can create long tines that break the PEP 8 79-character line length recommendation. You can use \ to break the chain of methods onto individual lines, but if you want to leave comments at the end of some of the lines, you’re out of luck. There’s another pattern, though, that solves this.

Brian Okken, co-host of the Python Bytes podcast, asks Twitter users about internal package distribution, and the Twitterverse responds.

(是也乎:

S3->GCP->Devpi

还是用 Devpi 发布内源 PyPI 最简洁了.

)


文章/教程/嗯哼

Articles, Tutorials and Talks

Follow along with this in-depth analysis of LEGO prices to see Bayesian analysis in action. Along the way, you’ll how pooled and unpooled linear models can be used to determine if a LEGO set is fairly priced. The article is quite technical, so experience with Bayesian statistics is recommended.

(是也乎:

PyMC 原先只是作为课堂讲义, 没想到慢慢的, 变成了 Bayes 标准库了...

)

Community-based programming courses are a great way to introduce folks to computer programming that otherwise may not have the means to do so. One of the barriers to learning to code is cost. You need a computer to program on, after all. But with the advent of tiny computers like the Raspberry Pi, computers aimed at education are more affordable than ever.

(是也乎:

RPi 400

用 RPi 替代主机配合显示屏+mouse+键盘, 形成只能学习编程的环境, 这下学员没办法安装游戏来玩了?

)

With this Python project, you’ll build a script to calculate grades for a class using pandas. The script will quickly and accurately calculate grades from a variety of data sources. You’ll see examples of loading, merging, and saving data with pandas, as well as plotting some summary statistics.

(是也乎:

Gradebook

)

Are you looking for an in-depth data science project to practice your skills on? Perhaps you would like to add new tools to your Python web development projects instead? This week on the show, David Amos is back, and he’s brought another batch of PyCoder’s Weekly articles and projects.

(是也乎:

podcast

Vue 真自成一派了.

)

PEP 661 proposes adding a utility for defining sentinel values in the standard library. In this article, you’ll get a summary of PEP 661, learn what sentinel objects are with real-world examples, and see how to use them with type hints.

In this step-by-step tutorial, you’ll learn how Python’s filter() works and how to use it effectively in your programs. You’ll also learn how to use list comprehension and generator expressions to replace filter() and make your code more Pythonic.

(是也乎:

filter

还是得配合 lambda 才能有麽幻感觉

)


好物/妙品/...

Interesting Projects, Tools and Libraries, Projects & Code

(是也乎:

真小说生成器

novelWriter )

(是也乎:

pyWhat

宇宙版反向 UUID ?

)

(是也乎:

textual

TUI (Text User Interface) 比 GUI 越来越嗯哼了...

rich

基于 Rich/富豪 框架

)

(是也乎:

Random

)


📆🐍 活动/大会

Events, MeetUp 真的是全球线下活动组织中心

(是也乎:

即便是线上的, 一样收费.

)


DAMA

❤️ Happy Pythonic ;-(大妈私人无责任播报)

(是也乎:

独创 logging + debug 模块

)

(是也乎:

老司机开新坑, 时隔10年, 沈游侠再次开声, 值得关注:

Bilibili

)

(是也乎:

私人初体验, 现在 http/https 资源混用浏览器越来越傲娇了, 最好有工具可以统一迁移...

)


PS:


好文笔,感叹号年度配额: 2/3

投稿/反馈邮箱:

askdama@googlegroups.com

(邮件列表地址, 当成正常邮件发送邮件就好, 不用注册, 不用翻越...)


ZoomQuiet/大妈

就是四处 是也乎,( ̄▽ ̄) 的那个大妈:

私自嗯哼: ZoomQuiet (订阅号: ZoomQuiet42)
公开课程: 蟒营 (订阅号: Mainium)
历史吐糟: Chaos42 (订阅号 PythoniCamp)

as 创始组织者:
    PyChina (订阅号: PyChinaOrg)
    本地社区: 
        GDG珠海 (订阅号: GDG-ZhuHai)
        TFUG珠海 (订阅号: ZH_TFUG)


Comments



自怼圈/年度番新

DU21.7
关于 ~ DebugUself with DAMA ;-)

关注公众号, 持续获得相关各种嗯哼:
zoomquiet

追问

任何问题, 随时邮件提问可也:
askdama@googlegroups.com


蟒营®编程思维提高班Python版

**2021.01.11** 因大妈再次创业暂停定期开设, 转换为预约触发:
  • + 扫描预约入群, 学员每满 42 人即启动新一期训练营 ;-)
  • 101camp21.7
  • + 任何问题, 随时邮件提问可也:
    askdama@googlegroups.com