[Translated]72% Of The People I Follow On Twitter Are Men (#4301)

* [Translating]72% Of The People I Follow On Twitter Are Men

Translating by Flowsnow!
72% Of The People I Follow On Twitter Are Men

* update
This commit is contained in:
CHL 2016-08-11 10:39:31 +08:00 committed by Ezio
parent c7000d08cd
commit 12235f6d3c
2 changed files with 83 additions and 88 deletions

View File

@ -1,88 +0,0 @@
Translating by Flowsnow!
72% Of The People I Follow On Twitter Are Men
===============================================
![](https://emptysqua.re/blog/gender-of-twitter-users-i-follow/abacus.jpg)
At least, that's my estimate. Twitter does not ask users their gender, so I [have written a program that guesses][1] based on their names. Among those who follow me, the distribution is even worse: 83% are men. None are gender-nonbinary as far as I can tell.
The way to fix the first number is not mysterious: I should notice and seek more women experts tweeting about my interests, and follow them.
The second number, on the other hand, I can merely influence, but I intend to improve it as well. My network on Twitter should represent of the software industry's diverse future, not its unfair present.
### How Did I Measure It?
I set out to estimate the gender distribution of the people I follow—my "friends" in Twitter's jargon—and found it surprisingly hard. [Twitter analytics][2] readily shows me the converse, an estimate of my followers' gender:
![](https://emptysqua.re/blog/gender-of-twitter-users-i-follow/twitter-analytics.png)
So, Twitter analytics divides my followers' accounts among male, female, and unknown, and tells me the ratio of the first two groups. (Gender-nonbinary folk are absent here—they're lumped in with the Twitter accounts of organizations, and those whose gender is simply unknown.) But Twitter doesn't tell me the ratio of my friends. That [which is measured improves][3], so I searched for a service that would measure this number for me, and found [FollowerWonk][4].
FollowerWonk guesses my friends are 71% men. Is this a good guess? For the sake of validation, I compare FollowerWonk's estimate of my followers to Twitter's estimate:
**Twitter analytics**
|men |women
:-- |:-- |:--
Followers |83% |17%
**FollowerWonk**
|men |women
:-- |:-- |:--
Followers |81% |19%
Friends I follow |72% |28%
My followers show up 81% male here, close to the Twitter analytics number. So far so good. If FollowerWonk and Twitter agree on the gender ratio of my followers, that suggests FollowerWonk's estimate of the people I follow (which Twitter doesn't analyze) is reasonably good. With it, I can make a habit of measuring my numbers, and improve them.
At $30 a month, however, checking my friends' gender distribution with FollowerWonk is a pricey habit. I don't need all its features anyhow. Can I solve only the gender-distribution problem economically?
Since FollowerWonk's numbers seem reasonable, I tried to reproduce them. Using Python and [some nice Philadelphians' Twitter][5] API wrapper, I began downloading the profiles of all my friends and followers. I immediately found that Twitter's rate limits are miserly, so I randomly sampled only a subset of users instead.
I wrote a rudimentary program that searches for a pronoun announcement in each of my friends' profiles. For example, a profile description that includes "she/her" probably belongs to a woman, a description with "they/them" is probably nonbinary. But most don't state their pronouns: for these, the best gender-correlated information is the "name" field: for example, @gvanrossum's name field is "Guido van Rossum", and the first name "Guido" suggests that @gvanrossum is male. Where pronouns were not announced, I decided to use first names to estimate my numbers.
My script passes parts of each name to the SexMachine library to guess gender. [SexMachine][6] has predictable downfalls, like mistaking "Brooklyn Zen Center" for a woman named "Brooklyn", but its estimates are as good as FollowerWonk's and Twitter's:
|nonbinary |men |women |no gender,unknown
:-- |:-- |:-- |:-- |:--
Friends I follow |1 |168 |66 |173
|0% |72% |28% |
Followers |0 |459 |108 |433
|0% |81% |19% |
(Based on all 408 friends and a sample of 1000 followers.)
### Know Your Number
I want you to check your Twitter network's gender distribution, too. So I've deployed "Proportional" to PythonAnywhere's handy service for $10 a month:
><www.proporti.onl>
The application may rate-limit you or otherwise fail, so use it gently. The [code is on GitHub][7]. It includes a command-line tool, as well.
Who is represented in your network on Twitter? Are you speaking and listening to the same unfairly distributed group who have been talking about software for the last few decades, or does your network look like the software industry of the future? Let's know our numbers and improve them.
--------------------------------------------------------------------------------
via: https://emptysqua.re/blog/gender-of-twitter-users-i-follow/
作者:[A. Jesse Jiryu Davis][a]
译者:[译者ID](https://github.com/译者ID)
校对:[校对者ID](https://github.com/校对者ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: https://disqus.com/by/AJesseJiryuDavis/
[1]: https://www.proporti.onl/
[2]: https://analytics.twitter.com/
[3]: http://english.stackexchange.com/questions/14952/that-which-is-measured-improves
[4]: https://moz.com/followerwonk/
[5]: https://github.com/bear/python-twitter/graphs/contributors
[6]: https://pypi.python.org/pypi/SexMachine/
[7]: https://github.com/ajdavis/twitter-gender-distribution

View File

@ -0,0 +1,83 @@
在推特上我关注的人72%都是男性
===============================================
![](https://emptysqua.re/blog/gender-of-twitter-users-i-follow/abacus.jpg)
至少,这是我的估计。推特并不会询问用户的性别,因此我 [写了一个程序][1] 根据姓名猜测他们的性别。在我的那些关注者中性别分布甚至更糟83%的是男性。据我所知,其他的还不全都是女性。
修正第一个数字并不是什么神秘的事:我注意寻找更多支持我兴趣的女性专家,并且关注他们。
另一方面,第二个数字,我只能只能轻微影响一点,但是我也打算改进下。我在推特上的关系网应该代表的是软件行业的多元化未来,而不是不公平的现状。
### 我应该怎么估算呢
我开始估算我关注的人(推特的上的术语是“朋友”)的性别分布,然后这格外的难。[推特的分析][2]经常显示相反的结果。 一个我的关注者的性别估算结果:
![](https://emptysqua.re/blog/gender-of-twitter-users-i-follow/twitter-analytics.png)
因此,推特的分析将我的关注者分成了三类:男性、女性、未知,并且给我们展示了前面两组的比例。(性别二值化现象在这里并不存在——未知性别的人都集中在组织的推特账号上。)但是我关注的人的性别比例,推特并没有告诉我。 [而这就是可以改进的][3], 然后我开始搜索能够帮我估算这个数字的服务,最终发现了 [FollowerWonk][4] 。
FollowerWonk 估算我关注的人里面有71%的都是男性。这个估算准确吗? 为了准确性我把FollowerWonk和Twitter对我关注的人的进行了估算结果如下
**推特分析**
| | 男性 | 女性 |
| --------- | ---- | ---- |
| **我的关注者** | 83% | 17% |
**FollowerWonk**
| | 男性 | 女性 |
| --------- | ---- | ---- |
| **我的关注者** | 81% | 19% |
| **我关注的人** | 72% | 28% |
FollowerWonk的分析显示我的关注者中81%的人都是男性很接近推特分析的数字。这个结果还说得过去。如果FollowerWonk 和Twitter 在我的关注者的性别比例上是一致的这就表明FollowerWonk对我关注的人的性别估算也应当是合理的。使用FollowerWonk我就能养成估算这些数字的爱好并且做出改进。
然而使用FollowerWonk 检测我关注的人的性别分布一个月需要30美元这真是一个昂贵的爱好。我并不需要FollowerWonk 的所有的功能。我能很经济的解决只需要性别分布的问题吗?
因为FollowerWonk 的估算数字看起来比较合理我试图做一个自己的FollowerWonk 。使用Python和[一些好心的费城人写的Twitter API封装类][5]LCTT译者注Twitter API封装类是由Mike Taylor等一批费城人在github上开源的一个项目我开始下载我所有关注的人和我所有的关注者的简介。我马上就发现推特的比例限制是极少的因此我随机的采样了一部分用户。
我写了一个初步的程序在所有我关注的人的简介中搜索一个和性别相关的代词。例如如果简介中包含了“she”或者“her”这样的字眼可能这就属于一个女性如果简介中包含了“they”或者”then“那么可能这就是性别位置的。但是大多数简介中不会出现这些代词。对于这种简介和性别关联最紧密的信息就是姓名了。例如@gvanrossum的姓名那一栏是“Guido van Rossum”第一姓名是“Guido”这表明@gvanrossum是一个女的。当找不到代词的时候我就使用第一姓名来评估性别估算数字。
我的脚本把每个名字的一部分传到性别检测机中去检测性别。[性别检测机][6]也有可预见的失败比如错误的把“Brooklyn Zen Center”当做一个名叫“Brooklyn”的女性但是它的评估结果与FollowerWonk和Twitter的相比也是很合理的
| | 非男非女 | 男性 | 女性 | 性别未知的 |
| ----- | ---- | ---- | ---- | ----- |
| 我关注的人 | 1 | 168 | 66 | 173 |
| | 0% | 72% | 28% | |
| 我的关注者 | 0 | 459 | 108 | 433 |
| | 0% | 81% | 19% | |
数据基于我所有的408个关注的人和1000个关注者。
### 了解你的数字
我想你们也能检测你们推特关系网的性别分布。所以我每月花费10美元将“Proportional”应用发布到PythonAnywhere这个便利的服务上
><www.proporti.onl>
这个应用可能会在速率上有限制否则会失败因此请温柔的对待它。github上放了源代码[代码][7] ,也有命令行的工具。
是谁代表了你的推特关系网?你还在忍受那些在过去几十年里一直在谈论的软件的不公平的分布组吗?或者你的关系网看起来像软件行业的未来吗?让我们了解我们的数字并且改善他们。
--------------------------------------------------------------------------------
via: https://emptysqua.re/blog/gender-of-twitter-users-i-follow/
作者:[A. Jesse Jiryu Davis][a]
译者:[Flowsnow](https://github.com/Flowsnow)
校对:[校对者ID](https://github.com/校对者ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: https://disqus.com/by/AJesseJiryuDavis/
[1]: https://www.proporti.onl/
[2]: https://analytics.twitter.com/
[3]: http://english.stackexchange.com/questions/14952/that-which-is-measured-improves
[4]: https://moz.com/followerwonk/
[5]: https://github.com/bear/python-twitter/graphs/contributors
[6]: https://pypi.python.org/pypi/SexMachine/
[7]: https://github.com/ajdavis/twitter-gender-distribution