Question : Count how many people has published paper(s) to more than one topic
Exemple:
Poo
has two papers which is associated toPhysics
only and since it has only one topic, it is not considered to be countedAmy
has one paper with two(2) topics (Physics
,Economics
), so it should be countedBaa
has two paper with distinct topic for each, so it should be counted
Example Dataframes:
| | id | name | has_published_papers |
|---|-----|------|-----------------------------------------|
| 0 | 100 | Amy | pp3524172 |
| 1 | 101 | Bla | pp0120888,pp0343660,pp0151738,pp0120631 |
| 2 | 102 | Foo | pp0134084,pp1262416,pp0120082,pp0117571 |
| 3 | 103 | Boo | pp0274558,pp0108872,pp1796960,pp0117509 |
| 4 | 104 | Soo | pp0120338,pp0993846,pp1375666,pp0407887 |
| 5 | 111 | Poo | pp0152095,pp1234567 |
| 6 | 112 | Baa | pp0237474,pp0152095 |
| | paper_id | name | topics |
|----|-----------|-------------|--------------------|
| 30 | pp3524172 | research A | Physics, Economics |
| 40 | pp0189076 | research B | Math, Physics |
| 55 | pp0237474 | research C | Education |
| 68 | pp2729488 | research D | Physics, Math |
| 79 | pp0152095 | research Z | Physics |
| 99 | pp1234567 | research X | Physics |
Edit
The desired output is an integer
representing the number of people who has published paper(s) to more than one topic