So, you may have heard about these things called countable elementary submodels. You may have heard that they work like magic and do all sorts of amazing things. “Mathematical voodoo” some might say. “Witchcraft!” others declare. Hearing this you become intrigued and set out to harness this black power. You quickly realize that there are very few places to learn this dark art; the protectors of this knowledge don’t want it leaking out.
Here I hope to lay out the essential things you need to know (and omit the things you don’t need to know) so that you can start using countable elementary submodels. I am going to lay out as little of the machinery as possible and display only the relevant applicable facts you will need for most proofs involving elementary submodels.
1. A Countable Submodel of What?
The universe of all sets is a ‘model’ for set theory, but it is too big. If we did have a model for set theory we would know that there is a countable submodel of it, by Lowenheim-Skolem. Of course we can’t assert that set theory has a model as this would be equivalent to asserting the consistency of set theory. The clever way around this is to realize that any proof in mathematics only ever uses finitely many axioms of set theory and references only finitely many specific sets. It is always possible to find a model
of those finitely many axioms and special sets. (Aside, for those of you who have seen this before, why doesn’t this violate the compactness theorem? It’s tricky.) Here
will be our copy of the universe, just for a given proof, and we will take a countable submodel of
, not
. This is where the language “Take a large enough fragment of ZFC” comes from.
As it turns out there is a class of sets that we usually draw from. We usually take
to be a set
, where
is a cardinal and
is the set of all sets hereditarily of cardinality less than
. This doesn’t really matter at all. So don’t fret about this.
2. What does the word ‘elementary’ in CESM mean?
A submodel of
is an elementary submodel (denoted
) if any statement true in
is true in
(given that we have relatavized the statement to
). Basically, if
and
have the same language and we can express a statement in that language it should be true in
iff it is true in
. Relativizing a statement to
means that we only quantify over things in
, not all of
. Also, we are not allowed to reference things outside of
.
For example, as dense linear orders . We know that in
the following statement
is true:
Well what convention did we just use? We just assumed that the only quantifies over rational numbers (and not say over complex numbers, or sets). Relatavizing this statement to
we see that it is also true, and now the statement
is
Notationally we say and
. (See Chris Eagle’s comment below)
Basically relativizing makes sure that a model can actually say something about the statement.
3. What does it mean for a model to “think something is true”?
Saying that thinks that a statement
is true is simply shorthand for
. So what? First of all we get paradoxical conclusions like
is uncountable. How can this be because
is countable?
Well, a model thinks that is uncountable because all of the injective functions
that are in
are not onto. So even though
actually has only countably many elements of
, the model also doesn’t have any way to check that. Even though from the outside we see that there is a bijection
, this function is not in
.
The thing to be careful about here is that ‘s copy of
is not the real copy of
. (How could it be?!
‘s copy has only countably many elements!)
4. How do I determine when something is in ?
SUPER USEFUL FACT: Any set definable from parameters in
is actually a member of
.
Remember that sets in sometimes have the form
where
are free variables in the statement
. If the objects in
are all in
, then
is definable from parameters in
.
For example, back in ,
is definable in
, but not in
.
This is kind of like super useful fact from forcing that asserts that I am allowed to define sets in the ground model even if I use the forcing symbol in definition, so long as all the other parameters are in the ground model.
5. What is the big difference between and
?
During a proof using CESM you will be using the super useful fact over and over. Why does it matter? Think about how sets work with respect to set operations (Power set, intersection, union, image of a function, etc.). In that case you want to know that applying these operations to a set will produce a set. In symbols, asserts that
is a set, but
does not assert that. For example the class of all ordinals ON
and is not a set, so we can’t do things like take
.
If then the model “knows about A” and we can use the super useful fact.
6. What is the really important thing I need to know about finite and countable subsets of ?
For sets of small cardinality sometimes we automatically get information about them.
FACT: Any finite subset of
is actually a member of
.
FACT: Any countable member of
is actually a subset of
.
7. What are the really important things I need to know about ordinals?
Many theorems using CESM start of by considering .
FACT 1:
is a countable ordinal. That is,
.
FACT 2: If
,
and
then
is a stationary set in
. In particular,
is uncountable.
Moreover we get the folowing (quite general) fact:
Assaf’s FACT: If
and there exists some
which is not in
, then
is uncountable.
Proof. Recall that any countable
is also a subset of
.
What else?
This should give you the basics that you need to know in order to read a proof using CESM. For more information about CESM see chapter 24 in Just & Weese’s book “Discovering Modern Set Theory; Volume 2”. I hope that this was helpful and please let me know if there is anything that needs changing.
Next week I will tackle two proofs using CESM.
Very nice, Mike!
is CESM,
and
, then
is stationary. Since
is not in
, I think it is worth also mentioning the following useful fact:
is CESM,
and there exists some
which is not in
, then
is uncountable.
is also a subset of
.
You pointed out that if
Fact. If
Proof. Recall that any countable
LikeLike
This is pretty sweet. I hope all the graduate students (and in my case, former graduate students) in set theory out there find it useful!
LikeLike
Excellent piece!
LikeLike
Hey Mike,
It’s very handy to have the useful facts all in one place! The thing I mentioned to you the other day is that in section 2, it isn’t technically true that
. For one thing,
includes symbols for which
doesn’t have an interpretation, namely
(assuming that you intended the underlying language of orderings to only have the symbol
). What is true is that
, since any quantifier (in any formula) is always interpreted as ranging over the elements of whatever structure you're in when you interpret it.
What is almost true is that
is true (in
) if and only if
, where
here denotes the "real" membership relation. I say "almost" because formally the definition of structures requires them to have universes which are sets, and hence
only makes sense if
is a set, whereas
makes sense even if
is a proper class.
I hope that made sense. Thanks again for the useful post!
LikeLike