It seems to be simple data manipulation operation. But I am stuck at this.
I have a recommendation dataset for a campaign.
Masteruserid content
1 100
1 101
1 102
2 100
2 101
2 110
Now for each user we want to recommend atleast 5 content. So for instance Masteruserid 1 has three recommendations, I want to pick remaining two randomly from globally viewed content, which is a separate dataset(list). Then I have to also check for duplicates in case if the randomly picked is already present in the raw dataset.
global_content
100
300
301
101
In actual I have around 4000+ Masteruserid's. Now I want assistance in just how to start approaching this.
