Q: Is reactionless propulsion possible?

Physicist: In a word: no.

A reactionless drive is basically a closed box with the ability to just start moving, on its own, without touching or exuding anything.  The classic sci-fi tropes of silent flying cars or hovering UFOs are examples of reactionless drives.

The problem at the heart of all reactionless drives is that they come into conflict with Newton’s famous law “for every action there is an equal and opposite reaction” (hence the name).  To walk in one direction, you push against the ground in the opposite direction.  To paddle your canoe forward, you push water backward.  The stuff you push backward so you can move forward is called the “reaction mass”.

In order to move stuff forward, you need to move other stuff backward.

This is a universal law, so unfortunately it applies in space.  If you want to move in space (where there’s nothing else around) you need to bring your reaction mass with you.  This is why we use rockets instead of propellers or paddles in space; a rocket is a mass-throwing machine.

But mass is at a premium in space.  It presently costs in the neighborhood of $2000/kg to send stuff to low Earth orbit (a huge improvement over just a few years ago).  So, the lighter your rocket, the better.  Typically, a huge fraction of a rocket’s mass is fuel/reaction mass, so the best way to make spaceflight cheaper and more feasible is to cut down on the amount of reaction mass.  The only way to do that at present is to use that mass more efficiently.  If you can throw mass twice as fast, you’ll push your rocket twice as hard.  Traditionally, that’s done by burning fuel hotter and under higher pressure so it comes shooting out faster.

In modern rockets the exhaust is moving on the order of 2-3 km per second.  However, your reaction mass doesn’t need to be fuel, it can be anything.  Ion drives fire ionized gas out of their business end at up to 50 km per second, meaning they can afford to carry far less reaction mass.  Space craft with ion drives are doubly advantaged: not only are they throwing their reaction mass much faster, but since they carry less of it, they can be smaller and easier to push.

The drawback is that ion drives dole out that reaction mass a tiny bit at a time.  The most powerful ion drives produce about 0.9 ounces of force.  A typical budgie (a small, excitable bird) weighs about 1.2 ounces and, since they can lift themselves, budgies can generate more force than any ion drive presently in production.

Compared to rockets, ion drives pack a greater punch for a given amount of reaction mass.  However, they deliver that punch over a very long time and with less force than a budgie.

Given the limitations and inefficiencies, wouldn’t it be nice to have a new kind of drive that didn’t involve reaction mass at all?  You’d never have to worry about running out of reaction mass; all you’d need is a power supply, and you could fly around for as long as you want.

That’s not to say that propellantless propulsion isn’t possible.  There are ways to move without carrying reaction mass with you.  You can use light as your exhaust (a “photon drive”), but you’ll notice that a flashlight or laser pointer doesn’t have much of a kick.  And you can slingshot around planets, but then the planet is your reaction mass.

The problem with reactionless drives, fundamentally, is that Newton’s third law has no (known) exceptions.  It is one of the most bedrock, absolute rules in any science and a keystone in our understanding of the universe.  On those rare occasions when someone thought they had found an exception, it always turned out to be an issue with failing to take something into account.  For example, when a neutron decays into a proton and electron, the new pair of particles don’t fly apart in exactly opposite directions.  Instead, the pair have a net momentum that the original neutron did not.

When a stationary neutron (gray) decays into a proton (red) and electron (blue), the new pair flies apart, but always favor one direction.  Newton’s laws imply that there must be a third particle moving in the other direction to balance the other two.

The implication (according to Newton’s law) is that there must be another particle to balance things out.  And that’s exactly the case.  Although the “extra particle that’s really hard to detect” theory was first proposed in 1930, it wasn’t until 1956 that neutrinos were finally detected and verified to exist.  The imbalanced momentum, a violation of Newton’s laws, came down to a missing particle.  Today neutrinos are one of the four ways we can learn about space, along with light, gravity waves, and go-there-yourself.

There are plenty of ideas floating around about how to create reactionless drives, such as the Woodward Effect or the Albecurrie warp drive.  But in no case do these ideas use established science.  The Woodward effect depends on Mach’s principle (that somehow inertia is caused by all the other matter in the universe), and reads like a pamphlet a stranger on the street might give you, while the Albecurrie drive needs lots of negative energy, which flat-out isn’t a thing.

Science is all about learning things we don’t know and trying to prove our own theories wrong.  While scientific discovery is certainly awe inspiring, it is also the exact opposite of wishful thinking.  That said, good science means keeping an open mind much longer than any reasonable person would be willing to.  In the ultimate battle between theoretical and experimental physics, the experimentalists always win.  If someone ever manages to create a self-moving, reactionless drive, then all the theories about why that’s impossible go out the window.  But as of now, those theories (standard physics) are holding firm.  We can expect that for the rest of forever, all space craft will have a tail of exhaust behind them.

Posted in -- By the Physicist, Physics, Relativity | 15 Comments

Q: How can I set up a random gift exchange that’s different from year to year?

The original question was: I’ve got a large family and we do a yearly gift exchange one person to one person. And I’d like to make a algorithm or something to do random selection without repeating for some time. And to be able to take old data and put it in to avoid repeats. I’m pretty good at math I’m 29 and my trade is being a machinist so I’ve got some understanding of how things kinda work.


Physicist: A good method should be trivially easy to keep track of from year to year, work quickly and simply, never assign anyone to themselves, make sure that everyone gives to everyone else exactly once, and be unobtrusive enough that it doesn’t bother anybody.  Luckily, there are a few options.  Here’s one of the simplest.

Give each of the N people involved a number from 1 to N.  The only things you’ll need to keep track of from year to year are everyone’s number and an index number, k.  Regardless of who the best gift-giver is, there are no best numbers and no way to game the system, and since no one wants to keep track of a list from one year to the next, you should choose something simple like alphabetical numbering (Aaron Aardvark would be #1 and Zylah von Zyzzyx would be #N).

Draw N dots in a circle then, starting with dot #1, draw an arrow to dot #(1+k), and repeat.  There’s nothing special about any particular dot; since they’re arranged in a circle #1 has just as much a claim to be first as #3 or #N.  When you finally get back to dot #1, you’ll have drawn a star.  Each different value of k, from 1 to N-1, will produce a different star and a different gift-giving pattern.  For example, if N=8 and k=3, then you get a star that describes the pattern {1→4→7→2→5→8→3→6→1} (that is “1 gives to 4 who gives to 7 …”).

When N is prime, or more generally when N and k have no common factors, you’ll hit every dot with a single star.  Otherwise, you have to draw a few different stars (specifically, “the greatest common divisor of k and N” different stars).  For example, if N=8 and k=2, then you need two stars: {1→3→5→7→1} and {2→4→6→8→2}.

Given N points, you can create a star by counting k points and drawing a connection.  This works great if N is a prime (bottom), since you’ll always hit every point, but when N isn’t prime you’ll often need to create several stars (top).

That’s why drawing is a good way of doing this math: it’s easy to see when your star is weird-shaped (your k changed halfway through) and really easy to see when you’ve missed some of the dots.

The “star method” gives you N-1 years of “cyclic permutations” (“cyclic” because when you get to the end you just go back to the beginning).  However, for large values of N that’s only a drop in the permutation sea.  Were you so determined, you could cyclicly permute N things in (N-1)! ways.

However!  With a family of 6 you’d need 5! = 1x2x3x4x5 = 120 years to get through every permutation.  More than time enough for any family to gain or lose members, or for some helpful soul to start questioning the point.  Moreover!  Each of those permutations are similar enough to others that you’ll start to feel as though you’re doing a lot of repeating.  For example: {1→2→3→4→5→6→1}, {1→2→3→4→6→5→1}, {1→2→3→5→6→4→1}, …

For those obsessive, immortal gift-givers who want to hit every permutation with the least year-to-year change, just to fight off boredom for another thousand years, there’s Heap’s algorithm.  For the rest of us, drawing stars is more than good enough.

Posted in -- By the Physicist, Combinatorics, Experiments, Math | 2 Comments

Q: How does “1+2+3+4+5+… = -1/12” make any sense?

Physicist: When wondering across the vast plains of the internet, you may have come across this bizarre fact, that 1+2+3+4+\ldots=-\frac{1}{12}, and immediately wondered: Why isn’t it infinity?  How can it be a fraction?  Wait… it’s negative?

An unfortunate conclusion may be to say “math is a painful, incomprehensible mystery and I’m not smart enough to get it”.  But rest assured, if you think that 1+2+3+4+\ldots=\infty, then you’re right.  Don’t let anyone tell you different.  The -\frac{1}{12} thing falls out of an obscure, if-it-applies-to-you-then-you-already-know-about-it, branch of mathematics called number theory.

Number theorists get very excited about the “Riemann Zeta Function”, ζ(s), which is equal to \zeta(s)=\sum_{n=1}^\infty\left(\frac{1}{n}\right)^s=1+\frac{1}{2^s}+\frac{1}{3^s}+\frac{1}{4^s}+\ldots whenever this summation is equal to a number.  If you plug s=-1 into ζ(s), then it seems like you should get \zeta(-1)=1+2+3+4+\ldots, but in this case the summation isn’t a number (it’s infinity) so it’s not equal to ζ(s).  The entire -1/12 thing comes down to the fact that \zeta(-1)=-\frac{1}{12}, however (and this is the crux of the issue), when you plug s=-1 into ζ(s), you aren’t using that sum.  ζ(s) is a function in its own right, which happens to be equal to 1+\frac{1}{2^s}+\frac{1}{3^s}+\frac{1}{4^s}+\ldots for s>1, but continues to exist and make sense long after the summation stops working.

The bigger s is, the smaller each term, \frac{1}{n^s}, and ζ(s) will be.  As a general rule, if s>1, then ζ(s) is an actual number (not infinity).  When s=1, \zeta(1)=1+\frac{1}{2}+\frac{1}{3}+\frac{1}{4}+\ldots=\infty.  It is absolutely reasonable to expect that for s<1, ζ(s) will continue to be infinite.  After all, ζ(1)= and each term in the sum only gets bigger for lower values of s.  But that’s not quite how the Riemann Zeta Function is defined.  ζ(s) is defined as \zeta(s)=\sum_{n=1}^\infty\left(\frac{1}{n}\right)^s when s>1 and as the “analytic continuation” of that sum otherwise.

 You ever analytically continue a staircase, but there isn't actually another step, so you're like "whoa, my math!"

You know what this bridge would do if it kept going.  “Analytic continuation” is essentially the same idea; take a function that stops (perhaps unnecessarily) and continue it in exactly the way you’d expect.

The analytic continuation of a function is unique, so nailing down ζ(s) for s>1 is all you need to continue it out into the complex plane.

Complex numbers take the form “A+Bi” (where i^2=-1).  The only thing about complex numbers you’ll need to know here is that complex numbers are pairs of real numbers (regular numbers), A and B.  Being a pair of numbers means that complex numbers form the “complex plane“, which is broader than the “real number line“.  A is called the “real part”, often written A=Re[A+Bi], and B is the “imaginary part”, B=Im[A+Bi].

That blow up at s=1 seems insurmountable on the real number line, but in the complex plane you can just walk around it to see what’s on the other side.

Left: ζ(s) for values of s>1 on the real number line. Right: The same red function surrounded by its “analytic continuation” into the rest of the complex plane.  Notice that, except for s=1, ζ(s) is completely smooth and well behaved.

\zeta(s)=1+\frac{1}{2^s}+\frac{1}{3^s}+\frac{1}{4^s}+\ldots defines a nice, smooth function for Re[s]>1.  When you extend ζ(s) into the complex plane this summation definition rapidly stops making sense, because the sum “diverges” when s≤1.  But there are two ways to diverge: a sum can either blow up to infinity or just never get around to being a number.  For example, 1-1+1-1+1-1+… doesn’t blow up to infinity, but it also never settles down to a single number (it bounces between one and zero).  ζ(s) blows up at s=1, but remains finite everywhere else.  If you were to walk out into the complex plane you’d find that right up until the line where Re[s]=1, ζ(s) is perfectly well-behaved.  Looking only at the values of ζ(s) you’d see no reason not to keep going, it’s just that the \zeta(s)=1+\frac{1}{2^s}+\frac{1}{3^s}+\frac{1}{4^s}+\ldots formulation suddenly stops working for Re[s]≤1.

But that’s no problem for a mathematician (see the answer gravy below).  You can follow ζ(s) from the large real numbers (where the summation definition makes sense), around the blow up at s=1, to s=-1 where you find a completely mundane value.  It’s -\frac{1}{12}.  No big deal.

So the 1+2+3+\ldots=-\frac{1}{12} thing is entirely about math enthusiasts being so (justifiably) excited about ζ(s) that they misapply it, and has nothing to do with what 1+2+3+… actually equals.

This shows up outside of number theory.  In order to model particle interactions correctly, it’s important to take into account every possible way for that interaction to unfold.  This means taking an infinite sum, which often goes well (produces a finite result), but sometimes doesn’t.  It turns out that physical laws really like functions that make sense in the complex plane.  So when “1+2+3+…” started showing up in calculations of certain particle interactions, physicists turned to the Riemann Zeta function and found that using -1/12 actually turned out to be the right thing to do (in physics “the right thing to do” means “the thing that generates results that precisely agree with experiment”).

A less technical shortcut (or at least a shortcut with the technicalities swept under the rug) for why the summation is -1/12 instead of something else can be found here.  For exactly why ζ(-1)=-1/12, see below.


Answer Gravy: Figuring out that ζ(-1)=-1/12 takes a bit of work.  You have to find an analytic continuation that covers s=-1, and then actually evaluate it.  Somewhat surprisingly, this is something you can do by hand.

Often, analytically continuing a function comes down to re-writing it in such a way that its “poles” (the locations where it blows up) don’t screw things up more than they absolutely have to.  For example, the function f(z)=\sum_{n=0}^\infty z^n only makes sense for -1<z<1, because it blows up at z=1 and doesn’t converge at z=-1.

f(z) can be explicitly written without a summation, unlike ζ(s), which gives us some insight into why it stops making sense for |z|≥1.  It just so happens that for |z|<1, f(z)=\frac{1}{1-z}.  This clearly blows up at z=1, but is otherwise perfectly well behaved; the issues at z=-1 and beyond just vanish.  f(z) and \frac{1}{1-z} are the same in every way inside of -1<z<1.  The only difference is that \frac{1}{1-z} doesn’t abruptly stop, but instead continues to make sense over a bigger domain.  \frac{1}{1-z} is the analytic continuation of f(z)=\sum_{n=0}^\infty z^n to the region outside of -1<z<1.

Finding an analytic continuation for ζ(s) is a lot trickier, because there’s no cute way to write it without using an infinite summation (or product), but the basic idea is the same.  We’re going to do this in two steps: first turning ζ(s) into an alternating sum that converges for s>0 (except s=1), then turning that into a form that converges everywhere (except s=1).

For seemingly no reason, multiply ζ(s) by (1-21-s):

\begin{array}{ll}    &\left(1-2^{1-s}\right)\zeta(s)\\[2mm]  =&\left(1-2^{1-s}\right)\sum_{n=1}^\infty\frac{1}{n^s}\\[2mm]  =&\sum_{n=1}^\infty\frac{1}{n^s}-2^{1-s}\sum_{n=1}^\infty\frac{1}{n^s}\\[2mm]  =&\sum_{n=1}^\infty\frac{1}{n^s}-2\sum_{n=1}^\infty\frac{1}{(2n)^s}\\[2mm]  =&\left(\frac{1}{1^s}+\frac{1}{2^s}+\frac{1}{3^s}+\frac{1}{4^s}+\ldots\right)-2\left(\frac{1}{2^s}+\frac{1}{4^s}+\frac{1}{6^s}+\frac{1}{8^s}+\ldots\right)\\[2mm]  =&\frac{1}{1^s}-\frac{1}{2^s}+\frac{1}{3^s}-\frac{1}{4^s}+\ldots\\[2mm]  =&\sum_{n=1}^\infty\frac{(-1)^{n-1}}{n^s}\\[2mm]  =&\sum_{n=0}^\infty\frac{(-1)^{n}}{(n+1)^s}  \end{array}

So we’ve got a new version of the Zeta function, \zeta(s)=\frac{1}{1-2^{1-s}}\sum_{n=0}^\infty\frac{(-1)^{n}}{(n+1)^s}, that is an analytic continuation because this new sum converges in the same region the original form did (s>1), plus a little more (0<s≤1).  Notice that while the summation no longer blows up at s=1, \frac{1}{1-2^{1-s}} does.  Analytic continuation won’t get rid of poles, but it can express them differently.

There’s a clever old trick for shoehorning a summation into converging: Euler summation.  Euler (who realizes everything) realized that \sum_{n=k}^\infty {n\choose k}\frac{y^{k+1}}{(1+y)^{n+1}}=1 for any y.  This is not obvious.  Being equal to one means that you can pop this into the middle of anything.  If that thing happens to be another sum, it can be used to make that sum “more convergent” for some values of y.  Take any sum, \sum_{k=0}^\infty A_k, insert Euler’s sum, and swap the order of summation:

\begin{array}{rcl}  \sum_{k=0}^\infty A_k&=&\sum_{k=0}^\infty \left(\sum_{n=k}^\infty {n\choose k}\frac{y^{k+1}}{(1+y)^{n+1}}\right)A_k\\[2mm]  &=&\sum_{k=0}^\infty \sum_{n=k}^\infty {n\choose k}\frac{y^{k+1}}{(1+y)^{n+1}}A_k\\[2mm]  &=&\sum_{n=0}^\infty \sum_{k=0}^n {n\choose k}\frac{y^{k+1}}{(1+y)^{n+1}}A_k\\[2mm]  &=&\sum_{n=0}^\infty \frac{1}{(1+y)^{n+1}}\sum_{k=0}^n {n\choose k}y^{k+1}A_k\\[2mm]  \end{array}

If the original sum converges, then this will converge to the same thing, but it may also converge even when the original sum doesn’t.  That’s exactly what you’re looking for when you want to create an analytic continuation; it agrees with the original function, but continues to work over a wider domain.

This looks like a total mess, but it’s stunningly useful.  If we use Euler summation with y=1, we create a summation that analytically continues the Zeta function to the entire complex plane: a “globally convergent form”.  Rather than a definition that only works sometimes (but is easy to understand), we get a definition that works everywhere (but looks like a horror show).

\begin{array}{rcl}  \zeta(s)&=&\frac{1}{1-2^{1-s}}\sum_{k=0}^\infty\frac{(-1)^{k}}{(k+1)^s}\\[2mm]  &=&\frac{1}{1-2^{1-s}}\sum_{n=0}^\infty\frac{1}{(1+y)^{n+1}}\sum_{k=0}^n{n\choose k}y^{k+1}\frac{(-1)^{k}}{(k+1)^s}\\[2mm]  &=&\frac{1}{1-2^{1-s}}\sum_{n=0}^\infty\frac{1}{(1+1)^{n+1}}\sum_{k=0}^n{n\choose k}1^{k+1}\frac{(-1)^{k}}{(k+1)^s}\\[2mm]  &=&\frac{1}{1-2^{1-s}}\sum_{n=0}^\infty\frac{1}{2^{n+1}}\sum_{k=0}^n{n\choose k}\frac{(-1)^{k}}{(k+1)^s}\\[2mm]  \end{array}

This is one of those great examples of the field of mathematics being too big for every discovery to be noticed.  This formulation of ζ(s) was discovered in the 1930s, forgotten for 60 years, and then found in an old book.

For most values of s, this globally convergent form isn’t particularly useful for us “calculate it by hand” folk, because it still has an infinite sum (and adding an infinite number of terms takes a while).  Very fortunately, there’s another cute trick we can use here.  When n>d, \sum _{k=0}^n{n \choose k}(-1)^{k}k^d=0.  This means that for negative integer values of s, that infinite sum suddenly becomes finite because all but a handful of terms are zero.

So finally, we plug s=-1 into ζ(s)

\begin{array}{rcl}  \zeta(-1)&=&\frac{1}{1-2^{2}}\sum_{n=0}^\infty\frac{1}{2^{n+1}}\sum_{k=0}^n{n\choose k}(-1)^{k}(k+1)\\[2mm]  &=&-\frac{1}{3}\sum_{n=0}^1\frac{1}{2^{n+1}}\sum_{k=0}^n{n\choose k}(-1)^{k}(k+1)\\[2mm]  &=&-\frac{1}{3}\cdot\frac{1}{2^{0+1}}{0\choose 0}(-1)^{0}(0+1)-\frac{1}{3}\cdot\frac{1}{2^{1+1}}{1\choose 0}(-1)^{0}(0+1)-\frac{1}{3}\cdot\frac{1}{2^{1+1}}{1\choose 1}(-1)^{1}(1+1)\\[2mm]  &=&-\frac{1}{3}\cdot\frac{1}{2}\cdot1\cdot1\cdot1-\frac{1}{3}\cdot\frac{1}{4}\cdot1\cdot1\cdot1-\frac{1}{3}\cdot\frac{1}{4}\cdot1\cdot(-1)\cdot2\\[2mm]  &=&-\frac{1}{6}-\frac{1}{12}+\frac{1}{6}\\[2mm]  &=&-\frac{1}{12}  \end{array}

Keen-eyed readers will note that this looks nothing like 1+2+3+… and indeed, it’s not.

(Update: 11/24/17)

A commenter pointed out that it’s a pain to find a proof for why Euler’s sum works.  Basically, this comes down to showing that 1=y^{n+1}\sum_{k=n}^\infty{k\choose n}\frac{1}{(1+y)^{k+1}}.  There are a couple ways to do that, but summation by parts is a good place to start:

\sum_{k=n}^Mf_kg_k=f_n\sum_{k=n}^Mg_k+\sum_{k=n}^{M-1}\left(f_{k+1}-f_k\right)\sum_{j=k+1}^M g_j

You can prove this by counting how often each term, f_rg_u, shows up on each side.  Knowing about geometric series and that {a\choose b-1}+{a\choose b}={a+1\choose b} is all we need to unravel this sum.

\begin{array}{rl}  &\sum_{k=n}^\infty{k\choose n}\frac{1}{(1+y)^{k+1}}\\[2mm]  =&{n\choose n}\sum_{k=n}^\infty\frac{1}{(1+y)^{k+1}}+\sum_{k=n}^\infty\left({k+1\choose n}-{k\choose n}\right)\sum_{j=k+1}^\infty \frac{1}{(1+y)^{j+1}}\\[2mm]  =&\sum_{k=n}^\infty\frac{1}{(1+y)^{k+1}}+\sum_{k=n}^\infty{k\choose n-1}\sum_{j=k+1}^\infty \frac{1}{(1+y)^{j+1}}\\[2mm]  =&\frac{\frac{1}{(1+y)^{n+1}}}{1-\frac{1}{(1+y)}}+\sum_{k=n}^\infty{k\choose n-1}\frac{\frac{1}{(1+y)^{k+2}}}{1-\frac{1}{(1+y)}}\\[2mm]  =&\frac{1}{y(1+y)^n}+\sum_{k=n}^\infty{k\choose n-1}\frac{1}{y(1+y)^{k+1}}\\[2mm]  =&\sum_{k=n-1}^\infty{k\choose n-1}\frac{1}{y(1+y)^{k+1}}\\[2mm]  =&\cdots\\[2mm]  =&\sum_{k=0}^\infty{k\choose 0}\frac{1}{y^n(1+y)^{k+1}}\\[2mm]  =&\frac{1}{y^n}\sum_{k=0}^\infty\frac{1}{(1+y)^{k+1}}\\[2mm]  =&\frac{1}{y^n}\frac{\frac{1}{(1+y)}}{1-\frac{1}{(1+y)}}\\[2mm]  =&\frac{1}{y^{n+1}}  \end{array}

That “…” step means “repeat n times”.  It’s worth mentioning that this only works for |1+y|>1 (otherwise the infinite sums diverge).  Euler summations change how a summation is written, and can accelerate convergence (which is really useful), but if the original sum converges, then the new sum will converge to the same thing.

Posted in -- By the Physicist, Equations, Math, Number Theory | 14 Comments

There’s something new under the Sun!

Physicist: Good news!

On October 19th, for the first time in history, we detected an object inside of our solar system that originated somewhere else.  It’s a rock the size of a stadium that passed inside of Mercury’s orbit and is presently on its way back out.

This is basically the plot of Arthur C. Clark’s seminal novel “Rendezvous With Rama” (sans aliens), which is about a thing from space passing really close to the Sun on a hyperbolic orbit before leaving the solar system.  The International Astronomical Union has incorrectly chosen the name “ʻOumuamua” for our first known interstellar visitor, rather than the obvious, pronounceable, and generally superior name “Rama“.  A rose is a rose is a rose, right up until the IAU gives it a weird name.

That dot in the middle is not from around here.  The background stars are streaks because this is a long exposure by a telescope tracking with Rama.  Everything we know comes from observations like this, so while we can determine speed and position, we have to estimate size and mass.

We can tell that Rama is new to the neighborhood because it is traveling far too fast to be gravitationally bound to the Sun.  When it’s in interstellar space, Rama travels at 26 km/s (with respect to the Sun), which is about how fast other stars in our stellar neighborhood are moving.  As it fell toward the inner solar system it gained speed until, at perihelion, it was a quarter of the distance from the Sun as the Earth and was screaming along at nearly 90 km/s.  For comparison, once Voyager 1 has shaken the Sun’s gravitational pull, it will be traveling at about 13 km/s through interstellar space.

The path of Rama (or ʻOumuamua if you insist), the first observed interstellar object to pass through the solar system.

We know that Rama is rocky because it’s not producing a tail.  Anything with lots of water ice boils when it gets close to the Sun.  Rama is distinctly tail-free, which means that it’s an old, cold rock that’s having its first and last Summer for millions or billions of years.

Presumably, Rama is just one of many interstellar objects to pass through our solar system.  We’ve been capable of detecting such objects for only a few decades and this one came remarkably close to us (several dozen Moon orbits, which is “close”).  If it hadn’t been so nearby, we probably never would have noticed it.  So if interstellar visitors are rare, then we are seriously lucky.

It has long been assumed that objects such as Rama should exist.  During the early formation of a star system the new planets and moons follow chaotic, often destructive orbits.  Not only do the young planetoids slam into each other, but near misses can fling asteroids, moons, and even planets out of a star system entirely.  As recently as 1982, we’ve witnessed Jupiter rudely chucking a comet into interstellar space.  The main reason things in a mature solar system (such as our own) follow such nice, clean orbits is that almost everything that didn’t stay in its own lane met a bad end a long time ago.  So we can expect the interstellar void to be thinly populated by an unimaginable number of rejected asteroids and comets.

Artist interpretation of a young solar system.

This is likely to be Rama’s backstory.  It formed somewhere that wasn’t here, and was ejected from its home star system through an impact event or gravitational slingshot.  The fact that it isn’t presently icy means that, for one reason or another, it spent enough time close to its parent star for all of its ice to boil off.  It could be a very old comet or a native of an inner solar system.  If we bothered to send a probe to catch up with it, we would be able to gain a lot of insight into the ancient chemical history of another star system.  It’s been drifting around for so long that it may be impossible to say which star it calls home, but just knowing how another solar system might be different is a big deal.  Fortunately, the movement of things in space is extremely predictable, so we’re not going to lose track of Rama anytime soon.  Unfortunately, we’d need a rocket about twice as fast as New Horizons (the fastest rocket to date) to catch up with it.

Totally swinging wide: the existence of Rama gives a little more credence to the idea of lithopanspermia, the idea that life might travel between stars embedded in rocks.  It has already been shown that some microbial life is perfectly happy in the cold, irradiated vacuum of space, and can even survive the massive accelerations involved in both ejection from, and falling onto, planetary surfaces.  What hasn’t been shown (until now) is that those interstellar rocks exist at all.  Spotting Rama in the sky is akin to living your whole life on a remote Pacific Island and seeing a (probably empty) boat for the first time.  Exciting stuff!

The picture of Rama itself is from here.

The picture of Rama’s trajectory (slightly altered) is from here.

The blender picture is from here.

Posted in -- By the Physicist, Astronomy | 6 Comments

Q: Is it more efficient to keep keep a swimming pool warm or let it get cold and heat it up again?

The original question: I’m having a debate with my wife that I think you can help us resolve.  We have a swimming pool in our back yard.  It has an electric heater, which we set to keep the pool water at 85 degrees Fahrenheit.  We’re going to be away for three days.  My wife says we should turn the heater off while we’re away to save energy.  I say that it takes less energy to maintain the pool at 85 while we’re away then to let it drop about ten degrees (summer evenings can get quite cool where we live in upstate New York) and then use the heater to restore 85.  Who’s right?  And what variables are relevant to the calculation?  The average temperature for the three days?  The volume of the pool?  The efficiency of the heater?


Physicist: The correct answer is always to leave the heater off for as long as possible, as often as possible.  The one and only gain from leaving a pool heater on is that it will be warm when you get in.  The same is true of all heaters (pool, car, space, whatever).

You can gain a lot of intuition for how heat flows from place to place by imagining it as a bunch of “heat beads”, randomly skittering through matter.  Each bead rolls independently from place to place, continuously changing direction, and the more beads there are in a given place, the hotter it is.

If all of these marbles started to randomly roll around, more would roll out of the circle than roll in.  Heat flow works the same way: hotter to cooler.

Although heat definitely does not take the form of discrete chunks of energy meandering about, this metaphor is remarkably good.  You can actually derive useful math from it, which is a damn sight better than most science metaphors (E.g., “space is like a rubber sheet” is not useful for actual astrophysicists).  In very much the same way that a concentrated collection of beads will spread themselves uniformly, hot things will lose heat energy to the surrounding cooler environment. If the temperature of the pool and the air above it are equal, then the amount of heat that flows out of the pool is equal to the amount that flows in.  But if the pool is hotter, then more “beads” will randomly roll out than randomly roll in.

A difference in temperature leads to a net flow of heat energy.  In fact, the relationship is as simple as it can (reasonably) get: the rate of heat transfer is proportional to the difference in temperature.  So, if the surrounding air is 60°, then an 80° pool will shed heat energy twice as fast as a 70° pool.  This is why coffee/tea/soup will be hot for a little while, but tepid for a long time; it cools faster when it’s hotter.

In a holy bucket, the higher the water level, the faster the water flows out.  Differences in temperature work the same way.  The greater the difference in temperature, the faster the heat flows out.

Ultimately, the amount of energy that a heater puts into the pool is equal to the heat lost from the pool.  Since you lose more heat energy from a hot pool than from a cool pool, the most efficient thing you can do is keep the temperature as low as possible for as long as possible.  The most energy efficient thing to do is always to turn off the heater.  The only reason to keep it on is so that you don’t have to wait for the water to warm up before you use it.

It seems as though a bunch of water is a good place to store heat energy, but the more time something spends being hot, the more energy it drains into everything around it.


Answer Gravy: This gravy is just to delve into why picturing heat flow in terms of the random motion of hypothetical particles is a good idea.  It’s well worth taking a stroll through statistical mechanics every now and again.

The diffusion of heat is governed, not surprisingly, by the “diffusion equation”.

\frac{d\rho}{dt}=k\frac{d^2\rho}{dx^2}

The same equation describes the random motion of particles.  If ρ(x,t) is the amount of heat at any given location, x, and time, t, then the diffusion equation tells you how that heat will change over time.  On the other hand, if ρ is either the density of “beads” or the probability of finding a bead at a particular place (if the movement of the beads is independent, then these two situations are interchangeable), then once again the diffusion equation describes how the bead density changes over time.  This is why the idea of “heat beads” is a useful intuition to use; the same math that describes the random motion of particles also describes how heat spreads through materials.

In one of his terribly clever 1905 papers, Einstein described how the random motion of individual atoms gives rise to diffusion.  The idea is to look at ρ(x,t) and then figure out ρ(x,t+τ), which is what it will be one small time step, τ, later. If you put a particle down somewhere, wait τ seconds and check where it is over and over, then you can figure out the probability of the particle drifting some distance, ε.  Just to give it a name, call that probability ϕ(ε).

ϕ(ε) is a recipe for figuring out how ρ(x,t) changes over time.  The probability that the particle will end up at, say, x=5 is equal to the probability that it was at x=3 times ϕ(2) plus the probability that it was at x=1 times ϕ(4) plus the probability that it was at x=8 times ϕ(-3) and so on, for every number.  Adding up the probabilities from every possible starting position is the sort of thing integrals were made for:

\rho(x,t+\tau)=\int\rho(x+\epsilon,t)\phi(-\epsilon)d\epsilon

So far this is standard probability fare.  Einstein’s cute trick was to say “Listen, I don’t know what ϕ(ε) is, but I know it’s symmetrical and it’s some kind of probability thing, which is pretty good, amirite?”.

ρ(x,t) varies smoothly (particles don’t teleport) which means ρ(x,t) can be expanded into a Taylor series in x or t.  That looks like:

\rho(x,t+\tau)=\rho(x,t)+\tau\frac{d}{dt}\rho(x,t)+\frac{\tau^2}{2}\frac{d^2}{dt^2}\rho(x,t)+\ldots

and

\rho(x+\epsilon,t)=\rho(x,t)+\epsilon\frac{d}{dx}\rho(x,t)+\frac{\epsilon^2}{2}\frac{d^2}{dx^2}\rho(x,t)+\ldots

where “…” are the higher order terms, that are all very small as long as τ and ε are small.  Plugging the expansion of ρ(x+ε,t) into \int\rho(x+\epsilon,t)\phi(-\epsilon)d\epsilon we find that

\begin{array}{ll}&\int\left[\rho(x,t)+\epsilon\frac{d}{dx}\rho(x,t)+\frac{\epsilon^2}{2}\frac{d^2}{dx^2}\rho(x,t)+\ldots\right]\phi(-\epsilon)d\epsilon\\[2mm]=&\rho(x,t)\int\phi(-\epsilon)d\epsilon+\frac{d}{dx}\rho(x,t)\int\epsilon\phi(-\epsilon)d\epsilon+\frac{d^2}{dx^2}\rho(x,t)\int\frac{\epsilon^2}{2}\phi(-\epsilon)d\epsilon+\ldots\\[2mm]=&\rho(x,t)+\left[\int\frac{\epsilon^2}{2}\phi(-\epsilon)d\epsilon\right]\frac{d^2}{dx^2}\rho(x,t)+\ldots\end{array}

Einstein’s cute tricks both showed up in that last line.  \int\epsilon\phi(-\epsilon)d\epsilon=0 since ϕ(ε) is symmetrical (so the negative values of ε subtract the same amount that the positive values add) and \int\phi(-\epsilon)d\epsilon=1 since ϕ(ε) is a probability distribution (and the sum of probabilities over all possibilities is 1).

So, \rho(x,t+\tau)=\int\rho(x+\epsilon,t)\phi(-\epsilon)d\epsilon can be written:

\begin{array}{rcl}\rho(x,t)+\tau\frac{d}{dt}\rho(x,t)+\ldots&=&\rho(x,t)+\left[\int\frac{\epsilon^2}{2}\phi(-\epsilon)d\epsilon\right]\frac{d^2}{dx^2}\rho(x,t)+\ldots\\[2mm]\tau\frac{d}{dt}\rho(x,t)&=&\left[\int\frac{\epsilon^2}{2}\phi(-\epsilon)d\epsilon\right]\frac{d^2}{dx^2}\rho(x,t)+\ldots\\[2mm]\frac{d}{dt}\rho(x,t)&=&\left[\int\frac{\epsilon^2}{2\tau}\phi(-\epsilon)d\epsilon\right]\frac{d^2}{dx^2}\rho(x,t)+\ldots\\[2mm]\end{array}

To make the jump from discrete time steps to continuous time, we just let the time step, τ, shrink to zero (which also forces the distances involved, ε, to shrink since there’s less time to get anywhere).  As τ and ε get very small, the higher order terms dwindle away and we’re left with \frac{d}{dt}\rho(x,t)=\left[\int\frac{\epsilon^2}{2\tau}\phi(-\epsilon)d\epsilon\right]\frac{d^2}{dx^2}\rho(x,t).  We may not know what ϕ(ε), but it’s something, so \int\frac{\epsilon^2}{2\tau}\phi(-\epsilon)d\epsilon is something too.  Call that something “k” and you’ve got the diffusion equation, \frac{d\rho}{dt}=k\frac{d^2\rho}{dx^2}.

The second derivative, \frac{d^2\rho}{dx^2}, is a way to describe how a function is curving.  When it’s positive the function is curving up the way your hand curves when you palm is pointing up and when it’s negative the function is curving down.  By saying that the time derivative is proportional to the 2nd position derivative, you’re saying that “hills” will drop and “valleys” will rise.  This is exactly what your intuition should say about heat: if a place is hotter than the area around it, it will cool off.

The diffusion equation dictates that if the graph is concave down, the density drops and if the graph is concave up, the density increases.

This is a very long-winded way of saying “think of heat as randomly moving particles, because the math is the same”.  But again, heat isn’t actually particles, it’s just that picturing it as such leads to useful insights.  While the equation and the intuition are straight forward, actually solving the diffusion equation in almost any real world scenario is a huge pain.

The corners cool off faster because there are more opportunities for “heat beads” to fall out of the material there.  Although this is exactly what the diffusion equation predicts, actually doing the math by hand is difficult.

It’s all well and good to talk about how heat beads randomly walk around inside of a material, but if that material isn’t uniform or has an edge, then suddenly the math gets remarkably nasty.  Fortunately, if all you’re worried about is whether or not you should leave your heater on, then you’re probably not sweating the calculus.

The shuttle tile photo is from here.

Posted in -- By the Physicist, Engineering, Physics | 15 Comments

Q: What determines the size of the bright spot when you focus sunlight with a lens?

Physicist: This question really appeals to my ten-year-old self.  If you’ve never tried to burn something with a lens, collect three pairs of sunglasses, a magnifying lens, and something you dislike.  On a bright day, put on all three pairs of sunglasses and give it a shot.

Burning stuff with a magnifying lens: education at its finest.

Typically, when you try to focus sunlight with a lens you’ll get something that looks like a comet.  You turn the lens and move it up and down and just at the moment when the bright spot gets the smallest, you suddenly see clouds.  This is because the smallest you can concentrate light from the Sun using a lens is an in-focus image of the Sun, and incidentally, an in-focus image of anything else in the sky as well.

Paper held in the “focal plane” of a pair of binoculars during an eclipse.  This is the smallest that the light can be focused.  If the paper were any closer or farther away the images of the Sun would become blurry and more spread out.

There are several types of lens, depending on how the two sides are curved, but in each case the defining characteristic is the “focal length”, f.  The big rule for lenses is: parallel light collects together at the same place, the “focal point”, which is f away from the lens.

Left: Parallel light beams collect together at a focal point, with a different point for each different set of parallel beams.  The collection of these points is the “focal plane”.  Right: Here we’ll consider the simplest case, light perpendicular to the lens, since the results apply in general and we can talk about the focal point (“the focus”) instead of the focal plane.

An “image” is the location where the light from an “object” is brought together by a lens.  The image is so named because if there happens to be a screen at that location, an image of the object will appear in-focus.  The distance to the image depends on the distance to the object.

For lenses, the rules are: #1) Parallel light beams will pass through the focus (dots) on the far side of the lens.  #2) Light passing through the center of the lens doesn’t change direction.  These rules allow us to figure out the size and location of the Image given the size and location of the Object (black bars).

It takes a little algebra (included in the “answer gravy” below), but rule #1 in the caption above leads to the Thin Lens Equation:

\frac{1}{d_i}+\frac{1}{d_o}=\frac{1}{f}

where do is distance from lens to object, di is the distance from lens to image, and f is the focal length of the lens.

Rule #2 is easier to work with and, with a little less algebra, leads to the magnification equation:

M=\frac{h_i}{h_o}=\frac{d_i}{d_o}

where hi and ho are the sizes of the image and object.  M is the factor for how much bigger the image is than the object.  For a microscope, you want M to be big.

The distance to the Sun, do, is infinite in every useful sense.  That’s why it took so long to figure out how far away it is.  As far as (reasonably sized) lenses go, a few miles may as well be the other side of the universe.  The distance to the Sun is somewhere in that “may as well be on the other side of the universe” range.  In fact, everything in the sky falls into the same category, which is why the night sky looks like a dome, rather than the unending void that it is.

Plug do=\infty into the thin lens equation and you find that \frac{1}{f}=\frac{1}{d_i}+\frac{1}{\infty}=\frac{1}{d_i}, and so di=f.  In other words, the Sun, and everything else “at infinity”, will be in focus f away from the lens.  This coincides with the definition of the focal length, since light from a source at infinity is always parallel.  That should jive with your our experience: if you look at a light ten feet away and you step back and forth, the angle to the light changes, but if you look at the Sun (don’t) and step back and forth, the angle to the Sun stays the same.

Now on the face of it, it would seem as though there’s no way to figure out the size of the Sun’s image, hi, since \frac{h_i}{h_o}=\frac{d_i}{d_o}=\frac{f}{\infty}=0.  As with the resolution to so many other unwanted infinities, all we need is a little algebra.

Without some fairly sophisticated techniques, it’s impossible to gauge how far away the Sun is.  But while do is out of reach (for most of us), \frac{h_o}{d_o} isn’t.  By measuring the Sun’s apparent size in the sky, it’s easy to figure out that it’s 110 times farther away than it is big.  The same thing, very coincidentally, is true of the Moon; it is 110 Moon-diameters away from Earth.  Mathematically speaking, \frac{h_o}{d_o}=\frac{1}{110}.

Retentive readers will recall that we haven’t brought the magnification equation, \frac{h_i}{h_o}=\frac{d_i}{d_o}, into play.  That was intentional; pretending there’s an issue heightens drama.  Solving for the image size, hi, and plugging in what we already know, \frac{h_o}{d_o}=\frac{1}{110} and di=f, we get:

h_i=\frac{h_o}{d_o}d_i=\frac{f}{110}

So, how big is the bright spot when you focus sunlight?  At best, a little less than 1% of the distance to the lens.  To concentrate light from the Sun as much as possible, you want to position the target (the solar oven, the ant trail, the piece of wood, whatever) at the lens’ focal distance.  When you do, the bright spot will have a diameter of \frac{f}{110}.  This ultimately comes down to the fact that the Sun is really far away, and 110 times smaller than it is distant.

The bigger a lens is, the more Sunlight it can gather.  So the best lenses for burning stuff are as big as possible (more light), with the shortest possible focal length (tiny spot).

Answer Gravy: Every now and again, it’s worth seeing how statements of fact turn into math.  Geometric optics (which sounds much more impressive than it is) basically boils down to the two rules mentioned above:

#1) Parallel light beams will pass through the focus on the far side of the lens.

#2) Light passing through the center of the lens doesn’t change direction.

The thin lens equation almost immediately falls out of these rules and the geometry of similar triangles.  Rule #2 is the easiest to work with.  Looking at the line that passes through the center of the lens, we find two similar triangle on either side.  The triangle on the left has legs ho and do and the other with legs hi and di.  Since these triangles are similar, the ratio of these lengths are the same: \frac{h_o}{d_o}=\frac{h_i}{d_i}.  Rearranging this to put the h’s and d’s on opposite sides produces the magnification equation, \frac{d_i}{d_o}=\frac{h_i}{h_o}.  Easy!

Using the same basic trick on the triangles formed by rule #1, we can find the thin lens equation.  Looking at just the right side (which side doesn’t matter), there are two triangles similar to each other.  A smaller one with legs f and ho and a larger one with legs di and ho+hi.

\begin{array}{rcl}\frac{h_o}{f}&=&\frac{h_o+h_i}{d_i}\\[2mm]\frac{h_o}{f}&=&\frac{h_o}{d_i}+\frac{h_i}{d_i}\\[2mm]\frac{h_o}{f}&=&\frac{h_o}{d_i}+\frac{h_o}{d_o}\\[2mm]\frac{1}{f}&=&\frac{1}{d_i}+\frac{1}{d_o}\end{array}

And there it is.  We start with a pair of intuitive, but difficult to apply principles, and end up with a pair of unintuitive, but easily applicable equations.

Posted in -- By the Physicist, Equations, Geometry, Math, Physics | Leave a comment