R2 DAY2-03 Information extraction with Python - jiawei chen (PyCon APAC 2015)
Speaker: jiawei chen
This talk will present a named entity recognition (NER) system for extracting attributes and values, like person, company, place or time, from various of text data. I will introduce how to combine several python tools to build this system. First, use a python written annotation tool BRAT to create a custom annotated corpus. Second, use python to link CRFsuite, training a Conditional Random Fields model to labeling our list of text data, the labeling result will be further analyzed by pandas and scikit-learn.
About the speaker
A search engineer, usually like to study machine learning and natural language processing.
Chia-Feng is a software engineer for cloud computing in ITRI. He works on cloud computing for more than 3 years and on OpenStack for 1 year. His primary interest is in distributed storage system and high availability for cloud computing.
Chia-Feng 是一位工研院的雲端運算軟體工程師。他接觸 OpenStack 已有一年的時間,並在雲端運算領域工作了三年以上。他主要的興趣在於雲端運算中的分散式儲存系統與服務的高可用性。
...
https://www.youtube.com/watch?v=FWhqcHf_JK8
Remember Reuven Lerner, the guest from our last episode?
Apart from his extensive experience in lecturing Python, he’s also a speaker in PyCon US 2022.
In his talk at the education summit, he shared how he was inspired by the similarity between being a Chinese learner and the learning journey his students experienced while taking Python courses. And he had utilized this concept to improve his teaching technique.
If you are on the path to mastering a programming language. You must not miss the advice he gave in this episode!
---
Guess what?
Reuven offer free online courses for those who are starting as a beginner. Go check out his course on his website now!
[Reuven's online course]: https://lerner.co.il/
---
留言告訴我你對這一集的想法: https://open.firstory.me/user/ckmkshy2r2s5a0823k3c1t8uz/comments
Follow “PyCon Taiwan”
⭐️ Official Website: https://tw.pycon.org
⭐️ Facebook: https://www.facebook.com/pycontw
⭐️ Instagram: https://www.instagram.com/pycontw
⭐️ Twitter: https://twitter.com/PyConTW
⭐️ LinkedIn: https://www.linkedin.com/company/pycontw
⭐️ Blogger: https://pycontw.blogspot.com
...
https://www.youtube.com/watch?v=FBnuLdokcwc
Day 1, R3 11:30–12:00
Have you ever thought the origin of Python packaging tools like distutils, easy_install, setuptools, pip, tox, venv, pipenv, and poetry? In the first section, you will learn the history of “Python Software Distribution” and the reason behind them.
Do you know how those mature solution in big company handle the Python packaging problems before those community’s solutions? In the second section, you will learn the big company’s efforts, lessons learned, and why did they decide to shift to community solution.
If you are interested in the history of python packaging or the story behind the big company's build-system, this is the right talk for you.
Slides: https://www.slideshare.net/ssuser2cbb78/time-travel-lets-learn-from-the-history-of-python-packaging-238397814
Speaker: Kir Chou
A code monkey builds search services in Amazon jungle. This will be the 4th year of his presence in PyCon TW.
在亞馬遜做搜索服務的碼猴,今年將會是它出現在PyCon台灣的第四年。
...
https://www.youtube.com/watch?v=TvFwG2VkpFU
Speaker: Zaki Akhmad
We can never rely on network firewall to be secure. We also must have a secure application. Besides test the functionality of the application, we must also test the security of the application. While the latter is frequently not performed hence the first is considered more important.
In this 25 minute talk, I'll share my experience using python for application security testing: from SQL injection, brute force attack, identifying and cracking password hashes, to proxy-ing the network traffic: intercept and modify it; and also doing network forensic.
About the speaker
Just another Python enthusiasts
Mainly use Python for application security testing
Planet Python Indonesia maintainer
Python Indonesia meetup organizer
...
https://www.youtube.com/watch?v=nMxiKcgoWZo
PyCon Taiwan 2023|Talk 演講|Day 1, R1 16:00–16:45
? 說明 Description ?
A layered data design pattern is a modern data architecture for building ETL/ELT data pipelines comprised of multiple stages so that each stage processes the data and improves the quality of the data progressively. Compared to the imperative way how data engineers build ETL/ELT data pipelines in the last decade, layered data architecture could be of great help in improving data quality steadily and progressively, and reducing data silos while project-specific teams are autonomously producing various data products. We will introduce, in this share, a technical solution based on layered data architecture. The solution is implemented by means of Dagster, a cloud-native data orchestrator with integrated lineage, observability, and a declarative programming model. A simple example will be presented in this talk to demonstrate concepts, principles, and data stack of the solution. In the end, the benefits we have gained from the implementation experience will be conveyed as well.
? 投影片 Slides:https://1drv.ms/p/s!AtNklwocKzYg8AEp6C6A-0jq8XN6?e=3ZFwU6
? 講者介紹 About Speaker - George T. C., Lai ?
A data practitioner with data analysis background who has been developing career mainly in Big Data and DevOps based on cloud-native ecosystem for 12 years. In the recent 7 years, I have been focusing on Data Architect, team management, and DevOps. As to technical experience, I got 6 years on Hadoop ecosystem, especially on Hortonworks HDP, 7 years on Kubernetes and 4 years on AWS/GCP. My personal vision is to make each data practitioner have a better life. I am approaching the vision by exploring new tools, discovering best practices, and delivering well-designed data architectures and technical solutions for data practitioners to relief their pain points and frustrations when coping with data.
Follow “PyCon Taiwan”
⭐️ Official Website: https://tw.pycon.org
⭐️ Facebook: https://www.facebook.com/pycontw
⭐️ Instagram: https://www.instagram.com/pycontw
⭐️ Twitter: https://twitter.com/PyConTW
⭐️ LinkedIn: https://www.linkedin.com/company/pycontw
⭐️ Blogger: https://conf.python.tw/
...
https://www.youtube.com/watch?v=wyO8VSkH81o
Day 3, R2 13:00–13:30
I have been using Ren'Py for eight years in Doujin activities. Ren'Py is Python based visual novel game engine. In the past eight years, I felt many benefits using Ren'Py. This talk demonstrates the benefits of using it. The author also describes how Ren'Py evolved from the user's point of view.
Slides not uploaded by the speaker.
Speaker: Daisuke Saito
Daisuke Saito is a assistant professor of the School of Fundamental Science and Engineering, Waseda University in Japan. He acquired a Doctor of Engineering degree from Waseda University in Japan. His research interests include programming education and digital game-based learning.
...
https://www.youtube.com/watch?v=bzkhPxLvk58
Day 1, R0 16:10–16:55
Apache Kafka is considered as a distributed streaming platform to a build real-time data pipelines and streaming apps. You can also take Kafka as commit log service with functions much like a publish/subscribe messaging system, but with better throughput, built-in partitioning, replication, and fault tolerance and runs in production in thousands of companies. Recently, Kafka has been widely applied as one component of SMACK stack because of it's role connected with Apache Hadoop, Apache Storm, and Spark Streaming in the data pipeline.
In this talk, I will start with introduce data stream processing and the general concept of Kafka's architecture and components by several use cases. Then, Kafka' API will be introduced by python clients with demo. Finally, the benchmark, comparison and limitation of different python clients will be discussed.
本演講將透過使用案例介紹Apache Kafka 的基本架構和組成概念,並藉由python client 的套件說明其API的使用,最後比較不同python client的差異和限制。
The speaker did not upload his slides.
...
https://www.youtube.com/watch?v=BhW9AUL7ea4