Episode 97, November 6, 2019
Remember when a hard drive that could hold a terabyte of data was a big deal? Well, we’re now in an era where peta-, exa- and even zetta-bytes are the bytes of the day, and it turns out it’s hard to fit that many zeroes on a hard drive. That’s where Dr. Ant Rowstron, Deputy Lab Director of Microsoft Research Cambridge, and Mark Russinovich, Chief Technical Officer of Azure, come in. Their respective teams are working on paradigm-breaking solutions to give us phenomenal storage power in an itty-bitty living space.
Today, Ant and Mark discuss their roles in the development of new optical technologies, like Project Silica, for cloud-scale storage demands, and talk about the Optics for the Cloud Research Alliance, an exciting new collaboration between academic researchers and MSR. They also explain how just the right mix of innovation and engineering can make the cloud more powerful and less expensive to use and, at the same time, deliver “forever” storage that’s both dishwasher and microwave safe!
Host: Welcome to another two-guest episode of the Microsoft Research Podcast, where we’re exploring the latest in disruptive technologies for data storage in the cloud. In an era in which we’re pushing the very limits of our current storage technologies, who better than the CTO of Azure and the Deputy Lab Director of MSR Cambridge to give us the lowdown on the promise of new technologies like quartz glass for storing – and protecting – all things digital?
Host: You’re listening to the Microsoft Research Podcast, a show that brings you closer to the cutting-edge of technology research and the scientists behind it. I’m your host, Gretchen Huizinga.
Host: Remember when the hard drive that could hold a terabyte of data was a big deal? Well, we’re now in an era where peta-, exa- and even zetta-bytes are the bytes of the day, and it turns out it’s hard to fit that many zeroes on a hard drive. That’s where Dr. Ant Rowstron, Deputy Lab Director of Microsoft Research Cambridge, and Mark Russinovich, Chief Technical Officer of Azure, come in. Their respective teams are working on paradigm-breaking solutions to give us phenomenal storage power in an itty-bitty living space.
Today, Ant and Mark discuss their roles in the development of new optical technologies, like Project Silica, for cloud-scale storage demands, and talk about the Optics for the Cloud Research Alliance, an exciting new collaboration between academic researchers and MSR. They also explain how just the right mix of innovation and engineering can make the cloud more powerful and less expensive to use and, at the same time, deliver “forever” storage that’s both dishwasher and microwave safe! That and much more on this episode of the Microsoft Research Podcast.
Host: The word excited is overused, but there’s really not a better word for how I feel about the guests I have in the booth today. First, we have, all the way from Cambridge, UK, Dr. Ant Rowstron, who’s the Deputy Lab Director of Microsoft Research Cambridge, and with him, I have Mark Russinovich, who is the Chief Technical Officer of Microsoft Azure. Welcome Ant and Mark, Mark and Ant!
Ant Rowstron: Thank you.
Mark Russinovich: Thanks.
Host: Your titles give us a picture of who you are at Microsoft, but they don’t really say what you do. And I’d like each of you to give us a little more detail on what gets you up in the morning. Maybe a sort of elevator pitch on what you do and why you do it? Ant, why don’t you give us a start here.
Ant Rowstron: Okay. So, you know, basically, as Deputy Lab Director in Cambridge, I come in every day and I get to work on cloud infrastructure and future technologies for the cloud. And it’s really fun. We’re doing a lot of stuff we’re using a lot of high tech stuff we’re doing a lot of stuff around optics, femtosecond lasers and things like that. And so, yeah, I come in every morning and I get to play with the greatest tech and try to create technologies that will help the cloud in the future.
Host: As a Deputy Lab Director, are you doing research yourself in addition to sort of overseeing the lab work?
Ant Rowstron: I, yeah, so I try to spend some of my time each week actually doing research as well, and I’m very involved in a small number of the projects, and I help oversee a number of the other projects as well in the group.
Host: As some people say, to keep your research muscle strong.
Ant Rowstron: Yeah, that’s right. That’s right. It’s the most exciting thing. It’s the bit I really enjoy, coming in and doing stuff.
Host: Mark, what do you do?
Mark Russinovich: So, my role is leading technical strategy and architecture for Azure, our cloud platform. So, every day is completely different. It could be meeting with customers, meeting with internal partner teams, meeting with engineers deep on Teams, writing papers, reading papers, meeting with analysts. I mean, every day is a new adventure.
Host: So as Chief Technical Officer, how technical do you get? Do you get into the weeds?
Mark Russinovich: I get into the weeds, yeah. That’s the fun part. Um, so, I think that that is really a really great part of the job is that I can be at a very high-level, like strategy, like directions, trends, and then get down into implementation details. In fact, I’m involved with a couple of other projects at MSR Cambridge at a very deep technical level.
Host: Well, perhaps to some of our listeners, you aren’t obvious booth mates, so let’s explain why you are both here in the booth together. There’s an interesting story of how Azure met Optics during a disruptive technology review where researchers present the work they’re doing to senior leadership team members. How did that go down? Ant, why don’t you go first.
Ant Rowstron: Okay. So, I mean, you know, it’s really interesting. We’d been working with the Azure Storage guys for some time on some storage projects, looking at trying to use traditional technologies to sort of store long-term data. And it had got to the stage where I think, you know, both sides had realized that we pushed the existing technologies to the limits, and both sides were looking around for new technologies. And it happened that someone in the Azure storage side and us both saw a sort of a news feed article about a breakthrough at Southampton and how to store data in glass. And we both approached it from very different angles. The Azure storage side, that person flew over to Southampton and had a look at it. We were there downloading papers trying to read them, trying to understand them. As a computer scientist, it was quite hard. And then we started to talk about how the technology could be taken forwards and trying to convince the company that this was going to be something to be worth doing. We started getting ready for this disruptive technology review. We reached out to Mark to help us because it was sort of key to get both sides really bought in to what was happening.
Ant Rowstron: And then we got the opportunity to present it to the SLT and Jason was there and you were there and that really led to the point where people said, hey, let’s make it work together, let’s work collaboratively between Microsoft Research and Azure Storage to get it forward. And so that’s how…
Mark Russinovich: Yeah.
Ant Rowstron: …Mark got involved and how it sort of got going.
Mark Russinovich: Yeah, in fact, I mean, I remember, so meeting with Ant before this where I got my first overview of what the project was all about and the promise of it, the disruption technology review happened. The SLT was like, this looks like really exciting stuff. But that alone isn’t enough to actually cause things to go forward. What happened after that is that me, Ant, Douglas Philips who is in charge of storage, and Jason Zander, head of Azure, met together to talk about how we wanted to take it forward. And we’d kind of estimated with MSR Cambridge the resources required and the amount of money required to actually get the thing moving to a point where it could potentially intersect with production in Azure at some point.
Mark Russinovich: And that was the go big or go home. Either we had to commit, and jointly fund it with MSR Cambridge, or it just wasn’t going to happen. And then we decided to do that.
Ant Rowstron: You know, at the time, we didn’t really have any people who worked in the space of physicists or in the space of optics. So, it really was the decision point: are we actually going to start doing this, and if we are, we’ve got to go out and hire new people with new skills we don’t have, traditionally in MSR, and bring them on board. So it was a big decision to do it and thankfully we made it!
Host: Mark, Microsoft wasn’t first to the cloud, but it’s a big player now and you’re at the technical epicenter of Azure. How have you seen the cloud computing landscape evolve over the time you’ve been here, both from an industry point-of-view and from Microsoft’s point-of-view?
Mark Russinovich: Just in the landscape of what cloud was and what it is now, it’s been a fascinating journey because when I started in Azure in 2010, Azure had been underway for a few years, just commercially launched, but Azure was tiny. Basically, in two datacenters, a few thousand servers. And the number of companies that were saying they were going to do cloud was probably twenty, twenty-five companies. And about four years ago, there were still about seventeen. Three years ago, we were down to thirteen. Two years ago, it was down to six. And there’s still six in what Gartner classifies as their “magic quadrant” for an infrastructure as a service. And I think there’ll be more paring down to come and Azure has been continuously in this strong position rising, rising and a couple of years ago, the market and analysts started to recognize us as the strong number two in this space.
Host: What are the factors that go into making you a player in this field?
Mark Russinovich: It’s a huge amount of capital. It has to be a core business priority for the company.
Mark Russinovich: And expertise from hardware, datacenters, all the way up in the software stacks and massive scale distributed systems. And also time is another factor because this stuff all needs to mature.
Ant Rowstron: I think it’s hard to underestimate the scale of this, right? About six or seven years ago, I was lucky enough to go to Dublin datacenter which is, I guess, by modern standards, a big one, but not the biggest, or at least… and in fact, then it was still being built out, as it were. And we got there, and you sort of walked for just mile after mile or kilometer after kilometer, depending which country you’re in. And you just saw servers. I mean, it’s just room after room after room, and each time you go in it, it is just thousands and thousands and thousands of servers. It was just a moment where, you know, it actually changed the way that I thought about research because there was no point in trying to do the stuff that we were trying to do, because when you realize how it operates, and in that scale, you are just wheeling in the rack, you don’t care. They’re just delivering racks and they have… you know, there’s sort of a production line. You know, racks roll in at one end, it comes through… in a sense, it’s rolled in, plugged in, a few cables, power, and just the scale was just phenomenal. And I think, unless you’ve actually been to one of those datacenters, it’s often hard to sort of appreciate.
Mark Russinovich: Yeah. The ironic thing is that there’s many people that work in Azure that have never been to a datacenter, so they have not experienced…
Ant Rowstron: They should go!
Mark Russinovich: Yeah, I know, they should!
Host: It just reminds me of that final scene of Indiana Jones where they put the Ark of the Covenant in an Army storage house and then the camera pans back and it’s like, you’ll never find it.
Mark Russinovich: Yeah, yeah, yeah.
Ant Rowstron: In fact, you have this great photo, I think, that you show sometimes, which shows sort of like a datacenter from the front and you go, this like the size…
Mark Russinovich: Yeah.
Ant Rowstron: …of a football pitch. And then you show it from the other side and it’s like ten times bigger and you say this is what we’re currently building out to extend that one datacenter. It’s just unbelievable.
Host: I love that Ant says football pitch and he probably means soccer field.
Ant Rowstron: Oh! Soccer field, yeah!
Mark Russinovich: I’ll translate it.
Host: All right. Well let’s talk about where and how we store our digital stuff because this is a huge part of what we’re talking about here. We have “storage units” – and I put air quotes around that right now – for our digital stuff, but they weren’t designed with the cloud in mind. And so, Ant, what are we up against with cloud-size demands on current technologies, is Moore’s Law really dead, and what are researchers exploring in optical technologies to help us out?
Ant Rowstron: If you think of most of the technologies we use to store data today – things like flash, things like hard disk drives, things like tape – it’s true to say that they were all designed before the cloud existed. And in fact, they were all designed to work in multiple scenarios. You know, the flash chips have to work in your phone as well as in the datacenters. Or the hard disk drives have to work with desktop machines as well as in the datacenters. So, there’s compromises that have been made. And, in fact, many of those technologies, if you think of a sort of S-curve where you got time, and you’ve got the metric of interest, you get a new technology comes along like a hard disk drive, it starts off very slowly and then hits its sort of main speed and it’s sort of going nicely. It’s, you know, doubling every two years or three years if you’re a hard disk drive. And then you get to the point where it actually stops just, basically, exploiting the property they were exploiting, and it rolls over and becomes flat. And I think one of the challenges for us as a company, and us as an industry, is that many of the technologies we rely on are beginning to get to the point where either they are at the end, or they’re starting to get to the point where you can see the end. And of course, Moore’s Law is a well-publicized one and we hit it some time ago. And that’s a great opportunity, because whenever you get that rollover, you get an opportunity to be able do things differently, to have new ways of doing things.
Ant Rowstron: I mean, we used FPGAs a lot. And this was something that we used to actually allow us to do stuff on the computational side. On the storage side, we started to look at these optical technologies and to see how we might be able to exploit things and new discoveries in order to create new storage technologies and new storage media that we could use when the traditional ones are reaching the end of their life cycle.
Host: Right. So, I want to drill in just a little bit there because when we talk about technologies that are invented before something else happens, now you have an opportunity with optics to build from the ground up. Does that affect the way you think about designing things?
Ant Rowstron: Yeah.
Host: Does it change your framework?
Ant Rowstron: Yeah, I think it does. I mean, it goes back to this point about going around a datacenter. When you actually see the scale of a datacenter, you start to think about things very differently. You start to think about, what does it mean to actually make something that works in that environment?
Ant Rowstron: And there’s tradeoffs that you can make. Perhaps it’s, you know, something can be a bit bigger. It doesn’t need to be a particular size. It can be long and thin. It can have different dimensions because that’s not the dominating factor is to get it under a desk as well as in the datacenter.
Host: Right, right, right. Mark, do you have anything to add to that?
Mark Russinovich: Well, I think, kind of related to the scale that Azure’s going towards, I think we’re going to talk about the scale of the data that’s coming…
Host: We are.
Mark Russinovich: …to the cloud and being generated in the cloud. And that’s the other dimension here. Once you get to that kind of scale, small optimizations, much less big ones, have massive impact on cost of storage, cost of keeping the media fresh. I mean, when we take a look at the amount of data that’s being generated in the cloud, and the way that that data is managed by existing technologies, which have limited lifetimes, and we’re talking about here in the cloud, there’s going to be data that’s going to be persisted for decades or hundreds of years…
Mark Russinovich: …and media like flash needs to be constantly scrubbed. Tape needs to be refreshed periodically, and if you think about like tape, archival media, petabytes, exabytes, zetabytes of it, having to be constantly read and rewritten…
Mark Russinovich: …that’s part of the cost model in these existing technologies.
Host: Mark, Azure has – to use a poker term – kind of gone all-in on optical technologies and become a leader in this arena, both in thought and deed, as it were, so why is optics important to Azure and where are these leading edge technologies in the product pipeline for you guys? And how does Microsoft compare with other players in the cloud market space as it pertains to optical technologies?
Mark Russinovich: As far as I’m aware, we’re the only cloud that’s pursuing this kind of technology. Now others might be pursuing it and being quiet about it, but I think that that’s a differentiation of Microsoft and Microsoft Research with regard to exploring these things and that is we openly talk about it. And I think that does a few things. One is, it attracts researchers and engineers that are interested in working on cutting-edge technology when they get excited about being able to really disrupt the future by working on a project like that. The other one is that it shows our customers that we’re a cloud that’s constantly innovating and exploring new ideas and looking at ways we can make the cloud, and their usage of the cloud, much more cost–efficient and powerful. So, both of those, I think, are reasons why we are so transparent about the things we work on.
Mark Russinovich: Now as far as where it is in the pipeline, we don’t know exactly when this is going to land in a way that it works at scale in the environments that we’re going to be deploying it in. There’s still a lot of questions to be answered. There’s some fundamental questions that Ant’s still exploring the answers to. And I think that, once we get the answers to the fundamental questions, that’s when you start to engineer for production and get it to the point where you can stamp these things out and deploy them around the world.
Ant Rowstron: One of the great things is, working so closely with Azure means that as we do answer more of these questions and we get more of a handle on it, we’re actually answering the questions that make it more feasible…
Mark Russinovich: Yeah.
Ant Rowstron: …for it to be deployed in Azure. And I think that’s one of these things about working in such close collaboration with the teams that are actually trying to deploy storage today and trying to use storage today, it’s a really powerful thing. You get that feedback loop going. You get people actually sort of able to consume the technology which is often very hard for a new technology.
Mark Russinovich: And then that allows you to zoom in on the answer.
Mark Russinovich: Like an optimal answer for what comes up.
Host: What are those fundamental questions that you still haven’t answered, Ant, and why haven’t you??
Mark Russinovich: Yeah, what’s taking you so long?
Ant Rowstron: That’s right!
Host: Dang it!!
Ant Rowstron: So, you know, we’re at the stage where we’re still trying to really control the technology. We’re still trying to understand how to improve the performance and the way in which it operates. And when we first saw the technology, you walked into a room in Southampton and, you know, they are effectively writing a few bits per second, reading a few bits per second. And you got to get something that is able to do, you know, hundreds of megabytes per second… there’s a long way to go. And to get there, it’s a mixture of engineering and innovation.
Ant Rowstron: So, some of the problems you engineer out, other problems you have to innovate around. And we’re being quite successful at innovating around some things, we’re also being quite successful at engineering things. But we still got a long way to go, I think, to make it really happen. But we can see the path which is always good.
Host: Right. I mean that’s half the battle is seeing where you are going. Well, let’s talk for a second about the balance between pushing the limits of existing technologies, and exploiting recent discoveries in new media like glass. We’ll go deep into Project Silica in a bit, but first, talk briefly about the current state-of-the-art in storage technologies, along with some other explorations across the labs in other media.
Ant Rowstron: Today, we’re using tape, we’re using hard disk drives and we use flash in the cloud. And, you know, a few years ago, the team in Cambridge worked very hard at trying to build out the lowest possible cost storage using hard disk drives. And, you know, we went to extreme lengths. In a rack, normally you have a couple of hundred fans. We built a rack with just three fans. You have sort of ten kilowatts going in to power it. We got it down to a few kilowatts. And I think the thing that you realize, when you do this sort of thing is, as you push the envelope more and more and more, you just hit the limitations. And in fact, you know, we were sort of getting to the stage where you pick up a hard disk drive and thinking I wish I could crack it open and start to change things inside it because I’ve done everything else I could do outside of the hard disk drive.
Ant Rowstron: And I think this is a challenge, in things like Azure, you get components coming in and some of them you can really change, some of them are more fixed. And one of the things I think that is sort of state-of-the-art stuff in picking new media is it’s really about them going to the next level. You know, we talk about this vertical sort of integration, going from the, really, the bottom layer up, whether it’s glass, whether it’s DNA… well, what if you can start from nothing and work up the stack and not have these kind of constraints of what is currently done?
Ant Rowstron: Or boundaries, often artificial boundaries, about what you can and cannot change.
Mark Russinovich: The fascinating thing about some of these things, to me, and this really highlights the power of MSR, is that we’re very much heads–down in Azure, looking really kind of short-term, maybe a little bit longer-term in some cases, but really focused on things that look we’ve got a really nail to be successful. Meanwhile, MSR Cambridge has a lot more flexibility and freedom to go and just explore ideas that sometimes look kind of out there.
Host: Well, and that actually brings up my next topic which is the unique relationship between research and product teams here at Microsoft. And I would love to know, from each of your perspectives, how important this partnership is in the cloud era. According to many of my guests who’ve sat in your chairs, this is one of those historical turning points, like the Xerox PARC people defined computing for the next sixty years back in the 70s. How are research and product working together on this and what do each of you get out of the relationship?
Mark Russinovich: Like I just was talking about, the fascinating, and really exciting, part of MSR is that they can go start to explore these things. Even Ant was exploring glass before we had the opportunity to tell him, this looks, like, ridiculous, stop looking at it. You know, he got it to the point where we saw the potential value and promise of it. But you’re right about this being a turning point. This is a massive shift in technology that’s happening right now, going from the client server era really into the cloud era, and the cloud plus edge era. And we’re defining what that world looks like right now. The systems we define, the services we make, the APIs we create, the technology that we use to power it, all of it is open for disruption for being new. And some of it has to be new just because it doesn’t exist yet. So, like Ant was talking about earlier, the opportunity to have this incredible impact on the future with something like glass, potentially, you know, if this plays out, in two decades everybody will be like, oh, yeah of course glass! And that’s just the way that you store data long-term, is glass. Right now, nobody’s thinking about it. But that’s incredibly exciting. Fun.
Ant Rowstron: Yeah, I mean I think one of the great things about where the cloud is at the moment is that you guys are so innovative, and you are so hungry for innovation, right? I’ve been at Microsoft for twenty years and it’s rarely that I’ve worked with product groups that are, you know, so open. And I think it’s because it’s a competitive world out there. I mean, it’s like you know…
Host: I’ve heard that!
Ant Rowstron: …yeah. And you know, anything you can do to out-innovate makes sense.
Mark Russinovich: Yeah.
Ant Rowstron: So, I think from a research perspective, it’s really great because you get to work with people, they get the real-world experience and they tell you about the…
Ant Rowstron: …things that are working and are not working. And then the flipside is, you know, inside of MSR we get the space to fail. And I think often, when you are in a product group, people want to work on things that ship, and are successful. And so, you know, in MSR, we can really do high-risk stuff. And failure is almost as good as making it work because at least you tried to make it work.
Mark Russinovich: And you learned something.
Ant Rowstron: And you learned something.
Ant Rowstron: And that’s exciting. And I think that’s something that’s really great about this whole, sort of, MSR relationship with the product groups is that we get that space, and yet we get that input that allows us to really make sure that when we do get to the end, if it has worked, we can get it over.
Host: So how often does the phrase, “that’s ridiculous” preface… preface a breakthrough?
Mark Russinovich: It’s probably – yeah, in the history of science and engineering? A lot!
Host: Let’s talk about this new program called the Optics for the Cloud Research Alliance. It’s a kind of big consortium of researchers, academia… MSR is obviously part of it. Why do we need this, and what are its goals?
Ant Rowstron: Within the team in Cambridge, we started to do a lot of stuff looking at you know, broadly, in different ways, optics for the cloud. If you think of it today, optics are used in the cloud. We use it to replace copper wires. We have a fiber optic cable and you use photons rather than electrons. But, you know, the question was, could we do more things? Can we make all–optical networks? Can we build out optical compute? Can we do optical storage? Now, as MSR, we don’t have a lot of people with that kind of experience.
Ant Rowstron: So, one of the great things seemed to do was to put together this Research Alliance, to go out and find universities that we can partner with. We can talk to them, we can tell them about some of the problems we’re having in the cloud, and let them think about the technologies they’re currently working on, how they relate to that. It allows us to have conversations, and it allows us to sort of build a community thinking around how this optics for the cloud could look like. And I mean, one of the interesting things is, you know, people often say, well, why did it start in the UK? And we started it in the UK. And I think, you know, historically, we had companies like British Telecom who were the telephone company in the UK, who invested a lot in optics, particularly in the areas of optical networking and things like that.
Ant Rowstron: So, we had these great places like Southampton University, like UCL, teams full of people with a huge amount of experience of using optics in different ways. And in fact, the Silica stuff, you know, it started at the University of Southampton. You know, they had this breakthrough of how to store data in glass. And we’ve been able to sort of help grow that, help expand that. It’s a very sort of powerful way to partner with these people and to collaborate.
Host: Yeah, so, Mark, does it ever percolate up to you? I mean, this is a research alliance with a bunch of academics and you are over here in Azure, running the joint. Is it opaque, what they are doing, except as it presents itself to you as new technologies?
Mark Russinovich: No, not completely opaque.
Mark Russinovich: I mean, one of the ways that Azure and MSR stay very connected is tight interactions that are constantly ongoing. So, it’s not like we drift apart and then MSR shows up at the door and says, hey, we’re thinking of this, and the project has been going for three years and can you take it and use it? We realized, a long time ago, that we needed to start engaging very early in these things.
Mark Russinovich: So, in fact, in Azure I am the champion for the MSR/Azure relationship. As part of that, I run a bi-yearly summit, effectively, between Microsoft Research and Azure, where Microsoft researchers will come and present on projects that they’ve started on, ideally early enough, that we can start to influence them…
Mark Russinovich: …if they look interesting to Azure. We’ve got one coming up just actually next week. This one we’re actually doing a little bit differently. And at the previous ones, I’ve stood up to the researchers and told them a little bit about what Azure’s challenges are and what our priorities are. At this one, we’re bringing in the heads of the Azure engineering groups to each talk about their own areas so they’ll get a deeper view of what Azure is looking at and this might, hopefully, you know, spur them to say, wait a minute, I’m passionate about this technology, this is a problem that Azure has, maybe we can apply it there and then that will start the formation of a new project.
Ant Rowstron: And of course, some of the leaders from Azure were in MSR…
Mark Russinovich: Yeah, that’s right.
Ant Rowstron: Like Albert Greenberg and David Maltz and people like this. So, there’s quite a cross–pollination of people as well, which help that communication channel going…
Ant Rowstron: …because, you know, you’re talking to someone that, you, you know, ten years ago, you knew because they were in an office down the corridor from you and now they are over in Azure. You don’t just stop communicating because you change…
Ant Rowstron: …you know, it’s good.
Host: Let’s talk about Project Silica, or “Azure on Glass” as it were. Ant, you’ve referred to this as the “journey from metal to glass.” I don’t know if that’s a rock n’ roll thing or what… but, let’s go deep. Um. Tell us all about Project Silica. It’s not incremental, it’s disruptive. It’s brand-new, ground-up, clean slate technologies. Tell us about it, how it came about, how it works and why it’s cool.
Ant Rowstron: So, how did it come about, right? I guess, you know, you’ve got a piece of glass, you fire a femtosecond laser into the piece of glass, and you focus that laser in a particular point in the piece of glass, and what happens is a structure forms, a nanograting. And the structure has this sort of orientation to it, and a depth, and you can control the orientation and the depth. That was sort of the discovery at Southampton that you could fire these in, correct these nanogratings and then control them and therefore encode, potentially, data in them. And you can think of this piece of glass as having layers of data in it. So, it’s a sort of 3-dimensional type of storage. So, you know, we’d been working very hard to work out the number of layers that you can have, how you can actually read the data out from these layers, and how you can control the laser in order to be able to actually form the voxels that you want, with the right properties that you want.
Host: Go in on what a voxel is, because not everyone will know.
Ant Rowstron: So it’s just a little structure. You can think of it as a sort of an upside-down iceberg almost. And it has depth to it and you can control that. But it also has these sort of little lines on it, a little grating on it and you can control the angle at which that grating is formed. And it’s really small. Think of it, you know, submicron. And that’s how you encode data in the glass. And you can make lots and lots of them within the glass. And once you’ve made that change to the glass, that change is going to exist in the glass for hundreds of thousands to millions of years. And quartz glass, it’s a very pure type of glass. And once you’ve made those changes, it doesn’t sort of decay. If you think of things of how disk drives, you get bit rot, you get bits that flip because of things that happen. With this, if it’s there, it stays there. You can boil it in water, you can put it in a microwave, you can put it in an oven. You can do almost anything you want to it and you’re not going to actually destroy the data that’s in it. That’s one of the things that’s really quite exciting about it is that this fact that, you don’t need to ever really check it. If the data’s there when you’ve written it, and you can read it out after you’ve written it, in a hundred years’ time, you’ll be able to read it out again.
Host: All right.
Ant Rowstron: Does that make sense?
Host: Well, to someone it does…
Mark Russinovich: It makes sense to me.
Host: Well, OK, so… Let’s talk a bit about compelling applications for the kind of data that you would want to put here. When I talked to you before, you referred to three kinds of storage: hot, warm and cold. Tell us a little bit about the different needs for storage and how this plays into that.
Ant Rowstron: Sure so, you know, hot data is data that you are accessing all the time. And, you know, if you think of things like email in your inbox, that’s the email that’s arriving today. You’re looking at it, you’re probably looking at it on lots of different devices. That’s the data which is hot. As data gets older, it sort of becomes less hot often, and so it becomes, sort of, warm. So, you can think about that as the emails you, perhaps, received last week or last month. And then you’ve got the stuff that you, perhaps, you know, never look at, your emails from four, five years ago, which you just rarely, if ever, look at. And that’s really cold data. And there’s all sorts of things which end up being cold. And with the glass stuff, we’re really looking at the cold data, the data which basically just sat there and it’s only read when it’s really needed. Examples of this is sort of backups of your photos and things like this.
Host: Are you enabling me to be a hoarder, a digital data hoarder??
Ant Rowstron: Yeah, I mean that’s the idea! I mean, if you think of it at the moment, right, actually you know, one of the challenges is storage costs, right?
Ant Rowstron: So, you only store the data that you can really afford to keep.
Ant Rowstron: Right? As you reduce the cost of storing data, you can store more and more of it because you can afford to store data that you think is cheap. And I have this example which is, you know, probably five years ago, when you took pictures on your phone, you would go through and edit those pictures and decide which ones to keep and which ones to throw away. Today, you probably just don’t bother. You just keep them all.
Host: I’ve got 32,000 pictures in the cloud.
Ant Rowstron: There you go. That’s the benefit of cheap storage.
Host: Or the detriment…
Ant Rowstron: But you know, that’s a lot of this thing about this data that we keep…
Ant Rowstron: …sometimes we don’t know the value of it, when it’s generated. It’s in the future that you suddenly discover, you know, that something was valuable and you just got rid of it because, at that moment in time, you didn’t realize it was going to be valuable.
Host: Right. Right, right, right. Well, Mark, talk about some compelling applications from an Azure perspective. Why would we want phenomenal storage power in this itty-bitty living space?
Mark Russinovich: Well, like Ant was saying, I think there’s lots of applications where you’ve got data that you are generating, that you’ll never know when you’ll need to go back and look at it. So archival data. Data that’s being generated from the ambient environment, or from systems that we use, or from business processes, that can be potentially interesting as we develop new ways to get insights out of data. So, if you take a look at, for example, even the US Census, where they’re constantly going back, there’s demographers that are looking back, historically, to see trends and try to identify patterns in the past that might be applicable in the future. There’s huge amounts of data that they typically have had to throw away just because we can’t store it all.
Mark Russinovich: You know, we can’t process it, even if we could store it. Now, with cloud scale, and these kinds of technologies, you can keep all of that and you don’t know what’s going to be useful or not. In the future at some point, we might come across something, some question or some new way of looking at data that we can go back now and get insights and use out of that previous data. But one of the most exciting areas that we’ve seen immediate interest in is from media and entertainment.
Host: Oh, interesting.
Mark Russinovich: Because if you take a look at these companies, their total IP is the content that they generate.
Mark Russinovich: And so, they want to preserve that. In many cases, they take a piece of content they’ve generated and they use that in multiple ways in the future and they don’t know how they’ll want to use it in the future. So, they keep it forever, because it might be, eventually, valuable. If you take a look at for example movies…
Mark Russinovich: …they’ll store movies, and when a new type of format comes out, like 4K or 8K or Blu-Ray or whatever…
Mark Russinovich: …they’ll re-release the movie in that new media format. Or they’ll take clips from old movies and use them in new productions. So they keep everything. Now, the only way that they trust the storage of this media, because of some of the issues that Ant’s talked about with things like flash and hard disk, is they’ll take it and they’ll put it in analog form on film, even if it was digitally mastered.
Mark Russinovich: And they do that because film actually has better properties, in terms of degradation, than some of these digital media. They’ll take it, put it on film in black and white and then have other strips of films in the color separations.
Mark Russinovich: They’ll roll them up in the canisters, make two copies, and put them in salt mines. And whenever they need them, they’ll go back and read them out.
Host: So wait. That’s state-of-the-art right now, is going back to film.
Mark Russinovich: Yep. And I mean, you can already think about the kind of costs and issues with that.
Host: Well, and...
Ant Rowstron: This is media that lasts for a really long time. It just doesn’t – I mean, if you think of the media we talk about today, how disk drives, tape, flash, you really wouldn’t trust any of them more than about fifteen to twenty years at most, right?
Host: Well, and even film…
Mark Russinovich: Yeah.
Host: I think we may have thought at some point, we need to get film onto something else because it degrades.
Mark Russinovich: Yeah. I mean, so they vacuum–seal these. They put them in these very cold environments with no light and so they do last longer than, you know, film that’s laying around. But still, that’s the only media that they’d actually trust with this kind of long-term data. So when they heard about this, because we talked to some of them, they immediately got excited, because now they can write it once, now they never have to go back and refresh it, and it’s 100% durable, and it’s the cost that we imagine here because the crystal glass is incredibly cheap… Cost is not actually a big factor for them, it’s really the other factors…
Host: So, you could put Gone With the Wind on a piece of glass and then use it as at trivet, if you’d like. No, I’m kidding…
Mark Russinovich: It’s actually funny. Ant’s lab put one of my novels on a piece of glass, Zero Day, so now I’ve got that preserved for millions of years.
Host: Okay. So that actually opens up – that opens up a couple of questions. One being, this has application for any kind of cultural artifacts that we might want to keep: literature, film, music, etc. Can we talk a little bit about how you get to it? I mean, I know I have, like I say, 32,000 photos. Part of my problem isn’t that I have them, it’s that I can’t find what I’m looking for.
Mark Russinovich: Yeah, well that’s a research problem that we haven’t been able to crack yet.
Host: That’s one of those foundational, fundamental questions….
Ant Rowstron: We’ve only just solved the actual storage question.
Mark Russinovich: How you actually find it is…
Host: It’s up to you. You haven’t sorted through your stuff. Well no, seriously, how do you address access?
Ant Rowstron: Well, I think, you know, increasingly, there’s sort of like machine learning technologies that are able to take images and recognize things in them. And so you can do search and things like that over your image set.
Mark Russinovich: Yeah.
Ant Rowstron: So, you’re, show me pictures of, you know, Person X, outside…
Mark Russinovich: Yeah. And that’s actually what they would do with movies for example. They’ll have machine learning algorithms run on the media and say, Character X…
Mark Russinovich: …is at minute 43… then you know, minute 44:33 seconds. And that goes into an index. So, when they way I want clips of this person…
Mark Russinovich: …they can go quickly find them.
Ant Rowstron: Yeah. Because if you think of it, a lot of media is historical media, right? And so it’s like if some event happens…
Mark Russinovich: Yeah.
Ant Rowstron: …somebody dies or something happens, and you want to find things relevant to that and um…
Mark Russinovich: Yeah.
Ant Rowstron: …you don’t want to pull back the whole archive just to find…
Ant Rowstron: …you know, two hours of video out of a thousand hours or something, you know.
Host: What about, like, news organizations? Is that another…
Mark Russinovich: Uh, absolutely. I mean what Ant was just referring to…
Mark Russinovich: …historical footage.
Mark Russinovich: You know, when a politician stands up and says, I never said that…
Host: Oh, yes you did!
Mark Russinovich: …they can just quickly go back to the index and very quickly say, yeah, you actually said it right here.
Host: Listen, this whole thing is a Black Mirror episode anyway… because it’s like that, you know, rewind thing… Well, we’re quickly moving from a terabyte and petabyte era to a… exabyte, zettabyte yottabyte… Let’s just say that the cloud is a data–beast and it needs to be taken care of. So not only do we need storage, we need other aspects to make sure that things work. And we’ve just alluded to, you know, access. What about compute and networking? What do we need there and what’s being worked on?
Mark Russinovich: Well, so there, I think a few things. If you take a look at processing data, and doing it very efficiently, you want to get the data to the compute very quickly. So, looking at reducing that latency, whether that means putting the compute right next to the data, or having very low latency connections between the data and the compute. And then, large scale, parallelized compute, because, oftentimes, being able to efficiently process data requires more than just one computer, one CPU, requires a fleet of them. So being able to have very low latency, high bandwidth interconnections between the computers themselves. Large AI model training is an example…
Mark Russinovich: …of that kind of workload that requires this kind of very low latency, high bandwidth connections between the different computers and lots of the larger machine learning models don’t fit on a single system or even a dozen. They require hundreds or thousands of them. For example, our Open AI announcement that we made is building an AI supercomputer with them that’s going to have these characteristics of having large-scale data, plus compute, plus networking, all working together.
Ant Rowstron: You know, we’re doing some other work on optical networking in the lab in Cambridge in Sirius and that’s looking at how to build out these all–optical networks for these kind of things which is all about trying to reduce the latency and make the networks, effectively, higher bandwidth, lower latency, which is…
Mark Russinovich: And yeah, lower power…
Ant Rowstron: Lower power…
Mark Russinovich: Which is lower cost…
Ant Rowstron: Lower cost, that’s right.
Mark Russinovich: And higher density, too, which is also lower cost.
Ant Rowstron: Yeah.
Host: So, you referred to that as Sirius.
Ant Rowstron: Yes, Sirius, Project Sirius is another one of the projects.
Host: Not like are you serious, but…
Ant Rowstron: No. Sirius.
Host: Sirius Black…
Mark Russinovich: Yeah. I think somebody from Azure probably said, “Are you serious?” when they first…
Host: “That’s ridiculous!”
Host: Well, it’s about now that I like to have a short conversation on the spectrum of risks associated with people’s work. I call this the “what keeps you up at night” section. What could possibly go wrong with anything that the two of you are working on collectively in this area?
Ant Rowstron: So, one of the things that keeps me up at night with, particularly with our storage work is, if you have a storage media, you expect it to store things. And we’re making decisions today and we’re making design choices today that potentially, in fifty or a hundred years’ time, we’ll want to make sure that they can still get the data out. Even in a thousand years’ time or ten thousand years’ time. I often think, are the decisions we’re making today the right decisions to enable that to happen? And, you know, if people store really important things like movies or their life’s photos in it, you want to make sure that it’s really there when you say it will be there and that keeps me awake a lot of the time. It’s a big responsibility, in my view.
Host: How about you, Mark?
Mark Russinovich: I have a high confidence in Ant’s team and the partnership we’ve got going with them gives me high confidence that we’re actually going to get there on this one.
Mark Russinovich: I think what keeps me up at night in this space like this in general is somebody else coming up with something that disrupts the space and leaves us behind. You mentioned it’s a competitive environment…
Mark Russinovich: …and so, this is another reason why it’s so important for us to be exploring these different areas that initially look like might have a little chance of success. But if they do, would massively transform things and this is one of those examples.
Host: So, Mark, you actually have a unique story about how you came to Microsoft, but people can find that story in Wired Magazine! So, I want to just ask Ant here about his path to Microsoft, Microsoft Research, and if there’s anything, Ant, that you have had happen in your life that’s informed or influenced your career that landed you here doing the work you are doing?
Ant Rowstron: So, I moved to Microsoft about twenty years ago. And before that I was at the University of Cambridge as a research associate and, you know, one of the really interesting things was, I worked with a guy called Andy Hopper there. And I finished my PhD and went to work with him. And as many academics do, as many young PhDs students becoming academics do, I just sort of worked on the next thing that was following after my thesis. And he came to my door and said to me, Ant, I want you to build me a robot football team to compete in the 1998 RoboCup in Paris, which is when the world cup in Paris was. I sort of ignored it. Carried on! Um. And a few weeks later he came back, and he said, you know, Ant, I really want you to build me a robot football team. So, I built a robot football team and they play on a table tennis table and there’s like five robots and they were all autonomous. So, you sort of just pressed go, and they had to play football.
Ant Rowstron: I still have the robot on my shelf in the office. But you know, the one thing that it did for me was that it really just taught me I could dive in. I mean, I used to work on distributed systems and programming languages, and where they met, and now I was ending up building robots and working with a lot of other people building robots and building systems like that. You know, throughout my career at Microsoft, that’s given me the confidence to just dive in and do other things, and I think that’s a moment for me that’s had a big influence on my career.
Host: As we close, there are 7 billion people on the planet, 4 billion of them are online, and computing is moving to the cloud. Ant and Mark, any parting shots? What would you like to say to our listeners in the form of advice or inspiration, wisdom or warning or just plain predicting the future? What’s on the horizon from each of your perspectives?
Mark Russinovich: I don’t think I’d give any warning. I’d give encouragement. Again, it’s that incredible opportunity to really shape the future. And it’s not just moving to the cloud. I think the important aspect to this is the edge as well, because computing is getting more organized on the edge and you’ll see more of it on the edge than what we’ve seen up to now, which is why Microsoft’s focused on intelligent cloud/intelligent edge. So that’s a huge part of our charter in Azure too, is looking at that space as well, and there’s a bunch of problems in new technologies that need to be developed there.
Host: Ant, how about you?
Ant Rowstron: As I said earlier, about the technologies and existing technologies getting to the end of their lifetimes, or at least getting to the stage where it’s not so easy just to get the next real cycle out of them, I think it’s just a huge opportunity to change things. And I think, as a researcher, anywhere where there’s a sort of huge opportunity for things to be done differently is just really exciting.
Host: This has been incredibly fun. Thank you, Ant Rowstron and Mark Russinovich, for joining us in the booth today.
Ant Rowstron: Thank you.
Mark Russinovich: Thanks.
To learn more about Dr. Ant Rowstron and Mark Russinovich, and how Microsoft is working to meet all your digital storage needs, visit Microsoft.com/research