A Fascinating New Higgs Boson Search By The DZERO
Experiment
By
Tommaso Dorigo

Reporting on scientific results to a broad audience is difficult, in my
opinion, not so much because of the need to explain things in a simple way
-which is easy and fun, once you master the matter- as for the
self-discipline you are forced to stick to.
Things that are obvious to you, because you have seen them and studied
them for years, are not obvious to others, not even to fellow scientist
from the other door who practice a just slightly different sub-field of
research, let alone to the larger pool of smart readers who are drawn to
science readings but have never taken an advanced course on the matter. An
alarm bell has to go off every time you hit on a concept which is likely
unfamiliar to most: maintaining in good operation that alarm bell is by
far the hardest part. At least, that is what my experience tells me.

So let me try and check how rusty that alarm bell is today, as I make an
attempt at reporting in hopefully simple terms on a new particle physics
search by the DZERO experiment (see right), one for a subatomic process
that I find extremely interesting, and which, once found, will make
manifest the hard core of our theory of electroweak symmetry breaking, a
fundamental pillar of our current understanding of the subnuclear world.
This process is the associated production in hadron collisions of three of
the fanciest building blocks that Nature (the bitch, not the magazine) has
provided our playground with: a top-antitop quark pair, and a Higgs boson.
This, by the way, is a good point for you to jump ahead by a couple of
sections in this long post, if you are curious about the new DZERO result
but know about, or are not interested in, some underlying basic facts
about hadron collisions.
Some background on hadronic collisions
In the case of the result I report today the hadron collider at hand is
not the LHC, the fast-asleep giant sitting underground below the border
between Switzerland and France, but rather the quite awake Tevatron
collider of Batavia, Illinois, which is traversed daily by a few
micrograms worth of protons and antiprotons, all rigorously traveling at
99.9 and something percent of the speed of light.
As protons and antiprotons hit each other at that fantastic speed, they
often just bounce off each other retaining their integrity: physicists
call that process "elastic scattering", but they are not interested in it,
because it tells them about as much on the inner structure of matter as a
glance at a glass of wine can tell you about its tannins.
In closer encounters the protons may break apart, but still nothing much
happens. It is only in those rarer instances when one of the most
energetic quarks or gluons making up a proton hit directly one of their
kin from the antiproton that things become interesting. The collision is
then hard enough that the two constituents -we call them "partons"- kick
each other off their envelopes, the parent proton and antiproton: they
then materialize in the form of streams of subatomic particles, energetic
sprays that we call hadronic jets; in still rarer instances,
their interaction instead gives rise directly to new states of matter.
(I have explained elsewhere why
quarks can only be seen as jets, and I will avoid repeating myself
here. Suffices to say that quarks cannot exist isolated, and they must
"dress up" in the form of hadrons. Hadrons are composed of pairs or
triplets of quarks: it is them, and not individually their constituent
quarks, the particles that make up the jets we observe in energetic
collisions.)
Doing business with a hadron collider -a proton-antiproton collider like
the Tevatron, for instance- is a frustrating experience: you spend your
money and wits to build a machine that accelerates those particles at
incredibly high energies, and then as you turn it on would like to see
that energy materialize into fantastically energetic jets, or even better,
new exotic, massive particles that can only be produced by exchanging the
projectiles' kinetic energy into mass. Instead, you have to accept the
fact that most of the collisions you get release way less energy than the
total theoretically available.
That is because what is colliding are not really the proton and antiproton
that you launched one against the other -or more precisely, one each among
a trillion of the former and a hundred billion of the latter. What really
collides is a (anti)quark or a gluon inside one projectile and a (anti)quark
or a gluon inside the other. And, since these constituents of the proton
only carry a small fraction of the total kinetic energy of their parent
"envelope", the total energy release is smaller than the sum of proton and
antiproton kinetic energy.

Particle physicists learn this fact in their playground years. Parton
Distribution Functions (PDF) have been devised to describe what is
the probability that a quark or a gluon is found with a given fraction of
their envelope's total energy. The graph on the left shows their density
as a function of the fraction x the partons carry, for different parton
species: g labels gluons, u are up-type quarks, d are down-type quarks,
etcetera. Needless to say, these functions get vanishingly small as the
fraction approaches unity: at the Tevatron for instance you will never,
ever, get a collision releasing 1.96 TeV, which is the sum of the proton
and antiproton energies provided by the superconducting, 4-mile-long
accelerator. The same, of course, goes for the LHC: 14 TeV will never be
actually reached by that machine; nope: not even if its currently ongoing
repairs exceed expectations!
How energetic can a collision be at the Tevatron, then ? That depends: the
larger the number of collisions you observe, the higher is the chance to
see a very energetic one. The most energetic collisions recorded by CDF
and DZERO, the two experiments built around the points where the Tevatron
proton and antiproton beams intersect, have a total energy release of
about 1 TeV, but they are exceedingly rare. Below is a 2-D bar chart of
the energy read out by the DZERO detector for two high-energy jets: the
detector is like a cylinder surrounding the collision point, and it has
been cut along one side and unrolled on the plane you see, to display the
localized energy deposits of the streams of hadrons which hit it. Such
events are spectacular: they are as close as you can get to actually
"seeing" two quarks.

Enter the production process
Now, the above introduction served one important purpose besides sorting
out the few of you who really want to get personal with elementary
particles: it provides important input to figure out why the production of
a top-antitop quark pair AND a Higgs boson is so rare and special. It is
only a part of the whole story, but let me use the acquired knowledge at
once. Top quarks are the heaviest ones in the lot of six we have figured
out matter can be made of. They weigh about 173 GeV each, which is the
total weight of about 184 hydrogen atoms! As for the Higgs boson, we may
assume it weighs 120 GeV here for the sake of argument: existing
experimental hints point to
a value not too far from the one above.
Let us make a simple addition: two top quarks, plus a Higgs boson, already
make a rest mass of 470 GeV. This is about a quarter of the total kinetic
energy of a proton-antiproton pair at the Tevatron, and it is an energy
which is only reached once in a million collisions or so. Those PDF are
indeed functions steeply peaking when the fractional energy is close to
zero, as I noted above.

There is at least another important thing to consider. Not all collisions
above 470 GeV produce a top, an antitop, and a Higgs boson! Quite on the
contrary, that piece of magic is a rare occurrence regardless of the
energy release. The rules for computing the probability of subatomic
production processes like the one we are discussing are enshrined in
Feynman diagrams, graphs which describe the space-time propagation of the
colliding and produced particles. On the left you can see one such
diagram: time is taken to flow from left to right here, and only one space
dimension is drawn, on the vertical axis.
As you see, the way a
reaction occurs
works by first producing a top-antitop pair, and then letting the duo (in
the case shown) "radiate off" a Higgs particle. Such a feat is predicted
to occur because the Higgs boson couples to any particle if the
latter possesses a mass, and that is for sure something that top quarks
are good at -they are the heaviest elementary bodies known to us. Just as
a photon can be emitted by any electrically charged body, and it does so
more readily if the charge of the latter is higher, a Higgs boson will
happily be emitted by any mass-endowed particle. The difference, however,
is that Higgs bosons are very heavy themselves, and you cannot produce
mass out of nothing: the top quark originating the Higgs boson has better
be very energetic to enable Higgs radiation.
Computing precisely the rate of tth production is beyond my decidedly
experimental expertise, but from the few hints I provided above you can
probably accept that such rate is seriously dampened by asking for a very
specific way of spending the 500 or more GeV of energy we have already
dearly paid for (a one-in-a-million chance, give or take a cow or two).
The easiest way to release that energy for a proton-antiproton collision
would be to produce two 250-GeV jets like the ones pictured above: any
quark pair would do, and even a pair of gluons would be a very probable
solution. Asking for top quarks makes this much less probable because most
of the energy has to go into the mass of the two fat guys; and asking that
one top quark spits out a Higgs boson makes this a real rarity.
(If you need more detail, here is an important piece: the dampening in
part comes from the narrowness of a thing called phase space.
Given a fixed 500 GeV energy budget to materialize a pair of quarks,
Nature way prefers light-mass ones, since almost all the initial energy
may then be allocated to endowing quarks with large momenta, and large
momenta translate in a larger span of allowed configurations. The more
configurations, the more probable a process is! But in our case it is
still worse: we need one of the top quarks to be produced with a mass much
larger than its nominal 173 GeV, since we need the surplus to materialize
the Higgs boson via radiation. Short-lived particles can indeed be created
off-mass-shell, but the probability for this to happen rapidly
decreases with the departure from their rest mass.)
From the above discussion you have certainly gathered that there are
several factors making tth production a very rare process. Well, that is
exactly true. And here is the final result: all in all, at the Tevatron
one expects less than one in ten trillion collisions to give rise to a ttH
final state. Once-in-ten-trillions is roughly as frequent as a total
failure of the "save a parent" strategy some dads and moms adopt when they
have to travel, taking different flights to reach the same destination:
once in ten trillion times, both planes crash (I am purposedly neglecting
correlated catastrophes such as 9/11 here, but you get the point). That is
what I call pretty darn rare, what do you think ?
...And they still search for those!
Despite the rarity of associated production of a top-antitop pair and a
Higgs boson -or maybe because of that!-, the CDF and DZERO experiments at
the Tevatron have started to look for it. That does not mean the
physicists in these collaborations are desperate: a search for a new
physical process predicted by your theory is intrinsically interesting and
worth pursuing even if you predict your experimental setting cannot reach
the required sensitivity to observe the process, because of a couple of
facts.
One: particle theory -the so-called Standard Model- has reached a
high level of refinement, but putting it to the test in new ways, such as
searching for a process that is predicted to be unobservable in a
particular environment, might eventually spot an otherwise unnoticed
breach in the theory, opening the way to the unknown. Two: by studying
peculiar and rare final states of particle collisions one might stumble in
some unexpected, unpredicted new process, again resulting in a disclosure
of new physics.
Also, please consider: measuring something which is predicted to be very
small or even better, exactly zero (or undiscernible from zero) is a very
effective way to probe your theory, much better than checking whether
physical quantity A is equal to seven hundred gudungoons or seven hundred
and twelve gudungoons. The reason for this is that zero is a quite
peculiar number: any departure from zero sticks out as a lamppost,
while a departure from a non-zero quantity can at most be as significant
as the prediction is precise. Or if you prefer, a rare unknown effect may
only be seen by checking if zero is exactly zero, rather than if some
non-zero quantity is in the high-gudungoons range or not.
In any case, there one last input I have so far denied you: the number of
collisions that have been produced since 2002 in the core of CDF and DZERO
amounts to about 400 trillions each! If the Higgs boson exists, a handful
of those events have most certainly been popping up: maybe once a year,
but they must have! Now, go tell the needle hiding in the haystack that
DZERO searches for a few events in 400 trillions, and you'll see it
rolling away in laughter. But physicists are clever! So let me tell you
how clever they have been now.
The DZERO search
DZERO searched for the
process in 2.1
inverse femtobarns of collisions, which is little less than a
third of what they have accumulated this far. This delay is due to the
gigantic amount of computing required to reconstruct and analyze the
information, as much as to the complexity of the analyses, which must at
some point "freeze" the datasets they use before performing further steps
which may take months to complete. Inverse femtobarns sound like
complicated things the first time you hear about them, but they actually
simply count the number of projectiles that have crossed a unit area in
the center of the detector. Multiply that by the "effective area" of their
targets, and you get the number of hits.
The first step of the analysis consisted in figuring out the best way to
get rid of most the large number of collision events stored by the data
aquisition system, those which were the least likely to have originated by
the production of the ephemeral trio of heavy particles and their
subsequent disintegration into stable particles -those actually producing
the observable detector signals which constitute the "event" data.
Here I have to unveil a detail I so far hid: the data stored by DZERO
amount to much fewer than hundreds of trillion events, because the rate at
which collisions occur in the core of the detector during data taking is
about three megahertz -three millions per second-, which is way too high
to allow detector readout and storage. A online trigger system in fact
takes care to neglect those collisions which resulted in elastic
scattering or low-energy release, and only read out and store the most
interesting, energetic events. The trigger system is one of the most
complicated parts of the detector hardware, but I will not discuss it
further here: suffices to say that the offline analysis starts with a
dataset of only billions of events, ones containing an already
pre-selected signal of high-energy electrons or muons.
Electrons and muons. These are the diamonds collider experiments mine for.
Their presence immediately signals that the original hadrons gave rise to
an electroweak interaction, as opposed to the much more frequent strong
interactions that quarks (and gluons) are most likely entertain themselves
with. A high-energy electron or muon can only be produced by the decay of
a W or Z boson or by a photon, the carriers of the electroweak
interaction. In our case, these particles signal the decay of top quarks,
and DZERO uses their presence to select a sample of top quark pair decays.
The decay chain of a top quark pair yielding one electron and jets can be
written as follows:
. Here you see
that one of the W bosons produced an electron-neutrino pair, while the
other yielded one further quark-antiquark pair. Each of the quarks (both
the bottom quarks duo and the generic ones labeled by the letter q)
produces a energetic jet of hadrons, and the top-antitop signature one
ends up observing in the detector is called "single lepton", to
distinguish it from "all hadronic" and "dilepton" ones containing
respectively only jets or two leptons and two jets. As for the neutrino
accompanying each lepton in the W boson decay, they leave the detector
unseen, but their presence is inferred by the imbalance in the energy
flowing out of the collision point.
My DZERO colleagues know extremely well how to best select top-antitop
events in the single lepton final state: they have been doing this since
CDF showed the way, in the days prior to the top quark discovery in 1995.
But in the case at hand, a Higgs boson must be present in the event: this
particle, if it is not too heavy, decays most of the time into an
additional pair of b-quark jets. The total signature DZERO is after is
thus one of lepton plus many jets, nominally six of them. Since, however,
jets may be lost in uninstrumented regions of the solid angle around the
collision point, or overlap to other jets, or fail to have enough energy
to provide a clear identification, DZERO chooses to concentrate on two
different signatures: lepton plus four jets, and lepton plus
five or more jets.
After selecting the above topologies, the data is still polluted with
processes that have little to do with top quark pair production (not to
mention the exceedingly rare tth events). A further cleanup is provided by
requiring that at least one of the jets contains an indication of having
been originated by a bottom quark. Bottom quarks are not as heavy as top
quarks, and thus are not as rare; but they are still
rare enough that their presence is a strong indication of a top decay.
In the end, DZERO applies a technique we might call "Divide et Impera"
(divide and rule): by slicing and dicing the selected event set into
different classes (each consisting in a well-defined event signature) a
small signal is easier to find, even if some classes will contain a much
smaller fraction of it than others. Below is a table from the
preliminary paper produced by DZERO: as you can see, electron plus
jets and muon plus jets are separated, and then events with only 4, or 5+
jets, are kept distinct. Further, the number of b-quark jets identified in
the event (dubbed "b-tags") allows to define a total of 12 exclusive
subsets. To decode them, take "5j2t" as an example: that class contains
events with at least 5 jets (5j), and exactly two b-tags found in them
(2t).

In the table, top-antitop events are estimated separately from "non-tt
backgrounds", to highlight the power of the DZERO selection: especially in
the 2- and 3-b-tags classes, the top contribution far exceeds the other
annoying processes that pollute the selected dataset. Also, the observed
event counts (shown in the last line for both electron+jet and muon+jet
topologies) match pleasingly the expected sum of backgrounds, indicating
that there are no surprises and possibly no mistakes. Finally, please
notice how the number of events has been reduced by orders of magnitude by
the combination of requirements on the presence of a lepton and many jets.
Still, the expected signal across the table sums up to less than one
event. One event in 626 ? That is not a needle in the haystack anymore!
Kudos to DZERO for their clever selection then!
The final step of the analysis is different from what has become common
practice in the searches of small signals buried in large backgrounds
nowadays. Rather than relying on the modeling of many observed
characteristics of the quite complicated kinematics these multi-object
final states possess, DZERO uses a single, dumb but foulproof variable:
the HT, which is computed as the sum of transverse energies of all the
jets, the lepton, and the inferred neutrino. This quantity provides a
quite model-independent tool to discriminate tth from tt events, and its
use sidesteps a theoretically nagging problem: our insufficient
understanding of the main background to the search, the production of top-antitop
quark pairs accompanied by bottom-antibottom pairs.
In the figure below, a HT distribution is shown for the sum of the two
classes most "signal-rich": the ones corresponding to five or more jets
and three or more b-tags. The black points show the observed data events
(a total of five entries), and the red distribution shows the expectation
from known processes (basically dominated by top-antitop production). The
black histogram instead shows the expected HT distribution that the signal
would display, if it had a production rate exceeding by 100 times the
predictions from theory. The different shape of red and black histogram is
the whole point of using HT as a discriminating variable.

The 12 HT distributions are finally tested by comparing data to the
expectation from the sum of backgrounds (which includes both top
production and non-top processes). A complex, very accurate method is used
to combine the variegated information coming from the twelve distributions
into a single response -the ratio between the probability that the data
contain both signal and background together, divided by the probability
that they contain only backgrounds. The study of that quantity allows to
set a upper limit on the number of signal events contained in the data,
which can then be converted in a limit on the signal cross-section by
suitably multiplying for the signal detection and reconstruction
efficiency, numbers that can be estimated by Monte Carlo simulation
programs.
The whole procedure is repeated for different hypotheses of the unknown
value of the Higgs boson mass, and the limit thus becomes a function of
it. The end result is displayed, as has become customary, as a curve
showing, as a function of the Higgs mass, the ratio between the resulting
upper limit on signal cross section and the cross section that is
predicted by the Standard Model theory. That is to say, if for a given
hypothetical Higgs mass a limit is set at a value equal 100, that means
that DZERO's data exclude the presence of tth production at a rate
exceeding 100 times the predicted production rate, in case the Higgs has
that mass.

As you can check in the graph on the right, the limit is not very
stringent! In fact, an exclusion of a mass range for the Higgs boson would
result if the limit curve assumed values below 1.0 in that range (the
hatched black line). Instead, the red curve (labeled Observed limit)
is floating at values well above 30 times the Standard Model prediction.
This should not be read to imply that the DZERO search has not been
fruitful or successful! The result is, in fact, quite interesting, for
several reasons.
First of all, an anomalous coupling of the Higgs boson to the top quark
might boost the production rate above Standard Model predictions, and the
DZERO limit sets a bound on that occurrence. Second, the final states
investigated by the search had never before studied in detail at the
Tevatron, and it feels good to see we do understand the production of top
quarks in association with b-quarks. Third, the search provides guidance
for future investigations in the same direction. Fourth, the limit,
although very loose, can be combined with advanced statistical procedures
with the much more stringent ones coming from higher-rate production
processes of the Higgs boson, eventually contributing to advancing our
knowledge of that elusive particle. Elusive is certainly an appropriate
word: Higgs bosons have been hypothesized more than forty years ago, and
we still have to see one of them...
The only remaining taks, at the end of this long article, is for me to ask
you, dear readers, whether the alarm bell I mentioned at the beginning
needs to be serviced!
Further reading:
A call for ideas on how to improve a prospective search for tth events
with the CMS detector
Home page of the DZERO experiment
Home page of the
Fermi National Accelerator Laboratory
Real-time display of Tevatron performances