Streaming a Million Likes/Second: Real-Time Interactions on Live Video

  Переглядів 44,633

InfoQ

InfoQ

3 роки тому

QCon London International Software Development Conference returns on April 8-10, 2024. Level-up on 15 major software and leadership topics including; The Tech of FinTech, What's Next in GenAI and Large Language Models (LLMs), Performance Engineering, Architecture for the Age of AI, Innovations in Data Engineering and more.
Learn the emerging trends. Explore the use cases. Implement the best practices.
Register now: bit.ly/3rRgsJV
----------------------------------------------------------------------------------------------------------------------------------
Video with transcript included: bit.ly/3nwZkBt
Akhilesh Gupta does a technical deep-dive into how LinkedIn uses the Play/Akka Framework and a scalable distributed system to enable live interactions like likes/comments at massive scale at extremely low costs across multiple data centers.
This presentation was recorded at QCon London 2020: bit.ly/2VfRldq
#LinkedIn #DistributedSystems #Scalability

КОМЕНТАРІ: 59
@vikram_saha7
@vikram_saha7 2 місяці тому
Perfect presentation !! The way in which he delivers the complex design choices in an easy and fun way is amazing!!
@-indeed8285
@-indeed8285 3 роки тому
Enjoyed way he present.. (Proper slide, Great transitions, No breaks, engaged with audience, making jokes and all of that without losing the context.)
@venkatapavankumarsannisett2259
@venkatapavankumarsannisett2259 2 роки тому
Brilliant articulation and to the point. Also for the resources in between. Thanks to infoq for such amazing real world scalable stories from the industry
@codetolive27
@codetolive27 2 роки тому
Amazing presentation. Loved the way he has presented.
@natadilip
@natadilip 2 роки тому
Amazing Explanation. This is clearly one of the best videos on qcon.
@umamahesh8774
@umamahesh8774 3 роки тому
Very well explained. Great Talk. Thanks
@chaitrakeshav
@chaitrakeshav 2 роки тому
Great talk! So informative. Thank you!
@visheshgupta4990
@visheshgupta4990 3 місяці тому
Thank you so much I watched the complete video and it is very helpful 🙌
@naveenjohn3402
@naveenjohn3402 2 роки тому
Wonderful talk. Very engaging and beautifully presented
@kanthipavuluri3874
@kanthipavuluri3874 Рік тому
Totally loved this evolutionary styled presentation, brilliant story telling!
@amolnigade8830
@amolnigade8830 Рік тому
Insightful information on live video interactions
@Rkrish2010
@Rkrish2010 3 роки тому
Awesome many thanks 😍😍☺️👍😊🤗😱
@NK-ju6ns
@NK-ju6ns 2 роки тому
Very engaging still calm and neat presentation. Insightful.
@user-jt5nd3yq4u
@user-jt5nd3yq4u 2 місяці тому
Amazing presentation, learned a lot from it!
@PuneetSethi89
@PuneetSethi89 Рік тому
Great video Akhilesh! It was very clear, easy to follow and educational.
@forfun7700
@forfun7700 2 роки тому
Great video + presentation. Thanks
@srcmake
@srcmake 3 роки тому
Really great presentation.
@muhammadnuri2910
@muhammadnuri2910 3 роки тому
Very useful and clear presentation.
@RahulKumar-lp5jc
@RahulKumar-lp5jc Рік тому
One of the best talks I have seen. Hats off to you. I have a doubt here. In more than 10k connections case, why do we need an additional layer of frontend nodes? Instead, why can't we store the mapping of video and Real-Time Dispatcher node in the distributed key-value store. The Likes backend can query the key-value store to get the dispatcher nodes for sending the like object. Once it reaches to dispatcher node, the node has the in-memory subscription table to send to the corresponding end user.
@AssassinoYunus
@AssassinoYunus 3 роки тому
What is the raw computing power of one single Frontend machine? How many cores and ram?
@preetiirrothi744
@preetiirrothi744 Рік тому
The best presentation i’ve seen!
@iamvasu2698
@iamvasu2698 3 роки тому
Really nice.. but what about DB scaling or Key-value store mapping..
@afrozalam5389
@afrozalam5389 3 роки тому
Great talk dude
@allmhuran
@allmhuran 3 роки тому
How is the key value store scaling? Is it replicated? Presumably the publication of "likes" does not have to be guaranteed, so inconsistency between replicas is acceptable (ie, some likes are not published to active subscribers)? (Edit: OK, second part of this question is covered in the Q&A, verifying the assumption, but is that also the answer to how the key value store scales?) And is that same premise applicable to other elements in the chain? Eg, if a front end times out in response to a publication request, or a client times out in response to a publication request, is it simply dropped or does it need to be retried? Presumably the former?
@yashwanthnerella9935
@yashwanthnerella9935 2 роки тому
Awesome 👏
@ankk98
@ankk98 2 роки тому
Thanks
@raymondsquared
@raymondsquared 3 роки тому
Really professional and easy to follow presentation. Loved it! Whats the benefit of using long polling vs websockets?
@curiousspirit3947
@curiousspirit3947 2 роки тому
curious too
@gxbambu
@gxbambu 2 роки тому
You don't really need websocket here, cuz sending likes and comments don't need to be real time
@rahulverma2523
@rahulverma2523 2 роки тому
@@gxbambu Yea, but they are calling it the "realtime" platform. Also, the platform is used for multiple use cases and not just for sending likes.
@ekamaloff
@ekamaloff 2 роки тому
@@gxbambu the whole point of this presentation was distributing likes and comments in real-time , which is required for live streaming
@akashjain2990
@akashjain2990 5 місяців тому
He kind off explained that in the first question from audience. Reason is : Long polling using server side events are basically plain HTTP requests; so support even very older devices; do not get blocked by firewalls etc. Both of these problems exists with WebSockets.
@javisartdesign
@javisartdesign 3 роки тому
Pretty interesting talk! I would really like to know how they manage idempotence and duplication for likes and messages.
@deformercr6680
@deformercr6680 3 роки тому
Just have an ID for each like I suppose
@javisartdesign
@javisartdesign 3 роки тому
@@deformercr6680 you do not want to store each like as a record separatelly, it must be just an aggregation or counter in database. I can think on a map-reduce operation in the stream
@deformercr6680
@deformercr6680 3 роки тому
@@javisartdesign yes you don't, but you assign it and store it in the client
@javisartdesign
@javisartdesign 3 роки тому
@@deformercr6680 that means you can use different devices and sessions for multiple likes... that is cheating.
@abhijit-sarkar
@abhijit-sarkar 8 місяців тому
How does a client find a frontend server?
@anandt8362
@anandt8362 2 роки тому
Thanks for the video...With regards to Kafka, 1. When the client starts to watch the video server we can establish connection and store [key value ] the connection it. 2. We can have front end nodes subscribe to specific topics[videos] which it is gonna handle 3. Whenever a like happens we push into kafka and the front end nodes which are responsible for sending the like to all the subscribed connections. 4. they can get it from the stored connection key value store and send the like to the clients through akka actors . What is the issue in that ? Please clarify ?
@yushutong722
@yushutong722 2 роки тому
+ 1 this is the part I'm confused. A given frontend server does NOT need to subscribe to topics of every live video, but only to the unique ones that its connections are watching
@gabbah79
@gabbah79 Місяць тому
I was thinking the same thing. Just two years later. ;)
@foreverursabhi
@foreverursabhi 2 роки тому
1. How does the KV store get updated when a front end dies? 2. How do the dispatchers find each other? What happens when one dies? 3. How does dispatcher broadcast prevent infinite loops? (1 publishes to 2 and 3, 2 publishes to 1 and 3, and so on) Why not use gossip protocol instead?
@lazypenguin3156
@lazypenguin3156 2 роки тому
I'm still learning distributed systems but here's what I think. 1. We need a time-to-live on the KV entry (I'd think of the KV as more of a cache than persistent storage bc the data is changing some much that I would not want to write to disk). 2. In the single datacenter model, dispatchers don't talk to other dispatchers. In the multiple datacenter model, I would probably use kafka to fan out instead of letting dispatchers talk to other dispatchers. 3. If I understand your question correctly, I do not think that dispatching an event will trigger another dispatch. I believe we only dispatch when a client sends a message.
@willemvdk4886
@willemvdk4886 2 роки тому
It's amazing how everything in IT boils down to layers of abstraction. Whether you're talking about local device I/O or about large distributed systems. It's abstractions all the way down. Once you realize that it gets so much easier to comprehend.
@karansaluja3315
@karansaluja3315 3 роки тому
In the multi Datacenter scenario, how do you ensure that a node is subscribed to only one dispatcher ?
@_overide
@_overide 2 роки тому
It doesn't have to, node can subscribe to any number of dispatcher. Dispatcher in DC1 simply broadcast likes to all peer dispatcher in other DC and then it get handled as usual.
@marvinalone
@marvinalone 3 роки тому
I think we have 10x viewers on 11/11 when thousands of sellers selling their products online thru live video
@zhakunigor
@zhakunigor 2 роки тому
Why do not include "likes" and "comments" into the stream itself letting client not only show video, but also handle other type of messages in the stream? In this way it's possible to reuse existing video-stream infrastracture fully
@nikolay6700
@nikolay6700 3 роки тому
i do not understand if they use Kafka
@sahilkharidia563
@sahilkharidia563 3 роки тому
Why not add a Dispatcher between client and Frontend Server during Kafka ?
@viczsaurav
@viczsaurav 3 роки тому
The problem will remain. The dispatcher will be bottleneck then and cannot be scaled as each dispatcher would have to consume all the events emanating from all kafka streams. Letting dispatcher decide which events to pull for breaks down the data chunk for it to process through thus providing the multiplicity we see at the end.
@aysarferas2253
@aysarferas2253 Рік тому
macam mana nak buat?
@greyreynyn
@greyreynyn 2 роки тому
lmao, man do they ever feel goofy being like "we have dedicated infrastructure and engineering for a fucking thumbs up button"
@greyreynyn
@greyreynyn 2 роки тому
"I get paid 600k USD a year to let people hit a like button" idk seems goofy to me
@avisheksanthaliya3714
@avisheksanthaliya3714 Рік тому
More like solve the problem "because its a nice engineering problem" than "because it will help the world". TBH I don't care how many people are reacting to a live video.
@lazypenguin3156
@lazypenguin3156 2 роки тому
why introduce the dispatcher? Why not just broadcast to all frontends?
@lazypenguin3156
@lazypenguin3156 2 роки тому
at 47:53 he explains it is a matter of scaling.
@travelingDeveloper
@travelingDeveloper 4 місяці тому
Yeah we can broadcast all likes to all frontend nodes but that won't be efficient, as there might be some frontend nodes in which connections does have only red live videos watchers so sending green live video likes to it doesn't make any sense.
@likoarreola8874
@likoarreola8874 3 роки тому
God and Jesus always
Scaling Facebook Live Videos to a Billion Users
51:31
InfoQ
Переглядів 87 тис.
Scaling Slack - The Good, the Unexpected, and the Road Ahead
37:54
😨Новая Война в GTA 5 Online #shorts
00:40
King Dm
Переглядів 1,6 млн
Evolution of Edge @Netflix
43:02
InfoQ
Переглядів 16 тис.
Why Distributed Systems Are Hard
41:09
InfoQ
Переглядів 34 тис.
Real-Time & Personalized Notifications @Twitter
50:57
InfoQ
Переглядів 19 тис.
Real-Time Delivery Architecture at Twitter
43:34
Parleys
Переглядів 56 тис.
End-to-end Exactly-once Aggregation Over Ad Streams | Yelp
33:57
Data Council
Переглядів 4,8 тис.
Kafka: A Modern Distributed System
52:25
InfoQ
Переглядів 39 тис.
Scaling Pinterest • Marty Weiner • GOTO 2014
45:53
GOTO Conferences
Переглядів 25 тис.
The ARM chip race is getting wild… Apple M4 unveiled
4:07
Fireship
Переглядів 531 тис.
Вы поможете украсть ваш iPhone
0:56
Romancev768
Переглядів 152 тис.