Introduction
Here we undertake the subject of Thermodynamics, including the 1st and 2nd Laws,
and how they impose certain constraints on material behavior
and the models that describe it. The subject quickly becomes rather abstract.
Nevertheless, it is a fundamental part of continuum mechanics.
1st Law
The 1st Law of Thermodynamics imposes the conservation of energy. It does so
in rate form by stating that the net sum of the rates at which energy is
transferred among different forms is zero. We will first list
all the different, relevant forms of energy, and then differentiate
them with respect to time to get the rate forms.
The relevant forms are
\[
\begin{eqnarray}
&& \text{Internal Energy} & \qquad \quad \int \rho \, u \, dV \\
\\
&& \text{Kinetic Energy} & \qquad \quad \int {1 \over 2} \rho \, {\bf v} \cdot {\bf v} \, dV \\
\\
&& \text{Internal Forces} & \qquad \quad \int {\bf f} \cdot {\bf u} \, dV \\
\\
&& \text{Surface Tractions} & \qquad \quad \int {\bf T} \cdot {\bf u} \, dS \\
\\
&& \text{Heat Generation} & \qquad \quad \int \left( \int \dot Q \, dV \right) dt \\
\\
&& \text{Heat Flux} & \qquad \quad \int \left( \int {\bf q} \cdot {\bf n} \, dS \right) dt
\end{eqnarray}
\]
where:
\(\rho\) is density
\(u\) is internal energy, a scalar
\({\bf u} \; \; \) is the displacement vector
\({\bf v} \; \; \) is the velocity vector
\({\bf f} \; \; \) is the body force vector
\({\bf T} \; \; \) is the Traction vector
\({\bf q} \; \; \) is the heat flux vector
\(\dot Q\) is the heat generation rate per unit volume
\({\bf n} \; \; \) is the unit normal vector to the control volume surface
\(dV\) is the differential volume element of the control volume
\(dS\) is the differential surface element of the control volume
\(dt\) is the differential time increment
Now take the time derivative of each term to obtain a rate of change.
\[
\begin{eqnarray}
&& \text{Internal Energy} & \qquad \quad \int \rho \, \dot u \, dV \\
\\
&& \text{Kinetic Energy} & \qquad \quad \int \rho \, {\bf a} \cdot {\bf v} \, dV \\
\\
&& \text{Internal Forces} & \qquad \quad \int {\bf f} \cdot {\bf v} \, dV \\
\\
&& \text{Surface Tractions} & \qquad \quad \int {\bf T} \cdot {\bf v} \, dS \\
\\
&& \text{Heat Generation} & \qquad \quad \int \dot Q \, dV \\
\\
&& \text{Heat Flux} & \qquad \quad \int {\bf q} \cdot {\bf n} \, dS
\end{eqnarray}
\]
The displacement vectors, \({\bf u}\), turn into velocity vectors,
\({\bf v}\). And the acceleration vector, \({\bf a}\), appears in
the kinetic energy term.
Now equate all these rates as follows: The rate of change of the internal energy
and kinetic energy in a control volume equates to the sum of all the other rates.
\[
\underbrace { \int \rho \, \dot u \, dV }_{\matrix{Internal \\ Energy}} +
\underbrace { \int \rho \, {\bf a} \cdot {\bf v} \, dV }_{\matrix{Kinetc \\ Energy}} =
\underbrace { \int {\bf f} \cdot {\bf v} \, dV }_{\matrix{Body \\ Forces}} +
\underbrace { \int {\bf T} \cdot {\bf v} \, dS }_{\matrix{Surface \\ Forces}} +
\underbrace { \int \dot Q \, dV }_{\matrix{Heat \\ Generation}} -
\underbrace { \int {\bf q} \cdot {\bf n} \, dS }_{\matrix{Heat \\ Flux}}
\]
Note that the heat flux term is negative because the energy is flowing out of the
control volume.
Internal Energy
Internal energy is the sum of thermal energy and elastic strain
energy. This is the thermal energy that is discussed in thermodynamics
classes, usually \(c_v T\), and the strain energy that is
discussed in mechanics classes, usually
\( {1 \over 2} \boldsymbol{\sigma} : \boldsymbol{\epsilon} \),
at least for linear materials.
The next step is to replace the traction vector, \({\bf T}\),
with \(\boldsymbol{\sigma} \cdot {\bf n}\). This gives
\[
\underbrace { \int \rho \, \dot u \, dV }_{\matrix{Internal \\ Energy}} +
\underbrace { \int \rho \, {\bf a} \cdot {\bf v} \, dV }_{\matrix{Kinetc \\ Energy}} =
\underbrace { \int {\bf f} \cdot {\bf v} \, dV }_{\matrix{Body \\ Forces}} +
\underbrace { \int {\bf v} \cdot {\boldsymbol{\sigma}} \cdot {\bf n} \, dS }_{\matrix{Surface \\ Forces}} +
\underbrace { \int \dot Q \, dV }_{\matrix{Heat \\ Generation}} -
\underbrace { \int {\bf q} \cdot {\bf n} \, dS }_{\matrix{Heat \\ Flux}}
\]
We now have two surface integrals of quantities dotted with the unit normal,
\({\bf n}\), to the surface. These are prime candidates for the application
of the divergence theorem to transform them into volume integrals.
Doing so gives
\[
\underbrace { \int \rho \, \dot u \, dV }_{\matrix{Internal \\ Energy}} +
\underbrace { \int \rho \, {\bf a} \cdot {\bf v} \, dV }_{\matrix{Kinetc \\ Energy}} =
\underbrace { \int {\bf f} \cdot {\bf v} \, dV }_{\matrix{Body \\ Forces}} +
\underbrace { \int \nabla \cdot ({\bf v} \cdot \boldsymbol{\sigma}) \, dV }_{\matrix{Surface \\ Forces}} +
\underbrace { \int \dot Q \, dV }_{\matrix{Heat \\ Generation}} -
\underbrace { \int \nabla \cdot {\bf q} \, dV }_{\matrix{Heat \\ Flux}}
\]
And apply the product rule to the term involving surface tractions.
\[
\int \nabla \cdot ({\bf v} \cdot {\boldsymbol{\sigma}}) \, dV =
\int \nabla {\bf v} : {\boldsymbol{\sigma}} \, dV +
\int {\bf v} \cdot ( \nabla \cdot {\boldsymbol{\sigma}}) \, dV
\]
As usual, this step is probably not obvious in matrix notation, but
tensor notation makes it clear.
\[
(v_i \sigma_{ij}),_j = v_i,_j \sigma_{ij} + v_i \sigma_{ij},_{j}
\]
Furthermore, the 1st term on the RHS can be manipulated as follows.
\[
\int \nabla {\bf v} : {\boldsymbol{\sigma}} \, dV \; \rightarrow \;
\int {\bf L} : {\boldsymbol{\sigma}} \, dV \; \rightarrow \;
\int {\boldsymbol{\sigma}} : {\bf L} \, dV \; \rightarrow \;
\int {\boldsymbol{\sigma}} : {\bf D} \, dV +
\int {\boldsymbol{\sigma}} : {\bf W} \, dV \; \rightarrow \;
\int {\boldsymbol{\sigma}} : {\bf D} \, dV
\]
Note here that \(\int {\boldsymbol{\sigma}} : {\bf W} = 0\) because
\( \boldsymbol{\sigma} \) is symmetric and \({\bf W}\) is antisymmetric.
Inserting all this into the summation equation now gives
\[
\underbrace { \int \rho \, \dot u \, dV }_{\matrix{Internal \\ Energy}} +
\underbrace { \int \rho \, {\bf a} \cdot {\bf v} \, dV }_{\matrix{Kinetc \\ Energy}} =
\underbrace { \int {\bf f} \cdot {\bf v} \, dV }_{\matrix{Body \\ Forces}} +
\underbrace { \int \boldsymbol{\sigma} : {\bf D} \, dV +
\int {\bf v} \cdot ( \nabla \cdot \boldsymbol{\sigma}) \, dV }_{\matrix{Surface \\ Forces}} +
\underbrace { \int \dot Q \, dV }_{\matrix{Heat \\ Generation}} -
\underbrace { \int \nabla \cdot {\bf q} \, dV }_{\matrix{Heat \\ Flux}}
\]
We now have all integrals occurring over the control volume,
and furthermore, three of them all involve quantities
being dotted with the velocity vector \({\bf v}\). Group these
three on the RHS and put the rest on the LHS to get.
\[
\underbrace { \int \rho \, \dot u \, dV }_{\matrix{Internal \\ Energy}}
- \underbrace { \int \boldsymbol{\sigma} : {\bf D} \, dV }_{\matrix{Surface \\ Forces}}
- \underbrace { \int \dot Q \, dV }_{\matrix{Heat \\ Generation}}
+ \underbrace { \int \nabla \cdot {\bf q} \, dV }_{\matrix{Heat \\ Flux}}
=
\underbrace { \int {\bf v} \cdot ( \nabla \cdot \boldsymbol{\sigma}) \, dV }_{\matrix{Surface \\ Forces}}
+ \underbrace { \int {\bf f} \cdot {\bf v} \, dV }_{\matrix{Body \\ Forces}}
- \underbrace { \int \rho \, {\bf a} \cdot {\bf v} \, dV }_{\matrix{Kinetc \\ Energy}}
\]
Now group everything together within common volume integrals.
\[
\int \left( \rho \, \dot u - \boldsymbol{\sigma} : {\bf D} - \dot Q + \nabla \cdot {\bf q} \right) dV
=
\int \left( {\bf v} \cdot ( \nabla \cdot \boldsymbol{\sigma}) + {\bf f} \cdot {\bf v} -
\rho \, {\bf a} \cdot {\bf v} \right) dV
\]
And factor the velocity vector, \({\bf v}\), out of each term on the RHS.
\[
\int \left( \rho \, \dot u - \boldsymbol{\sigma} : {\bf D} - \dot Q + \nabla \cdot {\bf q} \right) dV
=
\int \underbrace{ \left( \nabla \cdot \boldsymbol{\sigma} + {\bf f} - \rho \, {\bf a} \right)}_\text{= 0, Equilibrium} \cdot {\bf v} \, dV
\]
As indicated in the equation, the RHS equals zero because it is the equilibrium equation.
If the entire RHS equals zero, and the LHS equals the RHS, then
\[
\int \left( \rho \, \dot u - \boldsymbol{\sigma} : {\bf D} - \dot Q + \nabla \cdot {\bf q} \right) dV = 0
\]
And if the integral always equals 0 for any randomly chosen volume, then the expression inside it must do so too.
\[
\rho \, \dot u - \boldsymbol{\sigma} : {\bf D} - \dot Q + \nabla \cdot {\bf q} = 0
\]
This is the 1st Law of Thermodynamics. It is the equality which dictates that energy is conserved.
It is also written as
\[
\rho \, \dot u = \boldsymbol{\sigma} : {\bf D} + \dot Q - \nabla \cdot {\bf q}
\]
This form shows that internal energy increases as mechanical work is performed
and heat is generated within the control volume, but decreases as heat flows
out.
2nd Law
So why do we even need a 2nd Law of Thermodynamics? The answer is that the
1st Law does have one short coming. While the 1st Law makes sure
that energy is conserved, its weakness is that it would be perfectly
satisfied if a quantity of heat flowed from a cold object to a hot object.
As long as all the heat leaving the cold one arrives at the hot one, then the
1st Law is satisfied. Since this doesn't occur in nature, the 2nd Law
is needed to make sure that heat always flows "down hill".
The 2nd Law of Thermodynamics states that
\[
\underbrace { \int \rho \, \dot s \, dV }_{Entropy}
\; \ge \;
\underbrace { \int {1 \over T} \dot Q \, dV }_{\matrix{Heat \\ Generation}}
\; - \; \underbrace { \int {1 \over T} {\bf q} \cdot {\bf n} \, dS }_{\matrix{Heat \\ Flux}}
\]
where:
\(s\) is entropy per unit mass
\(T\) is absolute temperature
2nd Law Misconceptions
This equation is often misunderstood. Note that it does not say
that \(\dot s \gt 0\), only that \(\dot s\) is (algebraically) greater than
the RHS of the equation. And if more heat is flowing out of the material
than is being generated, then the RHS will be negative. Therefore, the
LHS need only be greater than the negative RHS. It can do this by
either being positive, or by being negative, but less-so than the RHS.
But there is an extra consideration, and it is that while the entropy of
one (or more) objects exchanging heat with each other can decrease,
the total entropy change of all objects involved in the process does
indeed have to be \(\ge 0\). This is easy to see by selecting a control
volume in the above equation that encompasses all objects exchanging
heat with each other. This would mean that no heat flows out of the
control volume, so \( \int {1 \over T} {\bf q} \cdot {\bf n} \, dS = 0\)
and \(\int \rho \, \dot s dV\) for the entire system must
be \(\ge 0\) as a result.
As was done with the 1st Law, apply the divergence theorem to the surface
integral.
\[
\underbrace { \int \rho \, \dot s \, dV }_{Entropy}
\; \ge \;
\underbrace { \int {1 \over T} \dot Q \, dV}_{\matrix{Heat \\ Generation}}
\; - \; \underbrace { \int \nabla \cdot ( {1 \over T} {\bf q}) \, dV }_{\matrix{Heat \\ Flux}}
\]
Now that since all integrals are over volumes, the contents can be extracted to obtain
\[
\rho \dot s \ge {1 \over T} \dot Q - {1 \over T} \nabla \cdot {\bf q} + {1 \over T^2} {\bf q} \cdot \nabla T
\]
At this point, somebody, perhaps Clausius, decided hundreds of years ago that the last term could
be neglected - perhaps because the \(T^2\) in the denominator makes the term small.
Whatever the reason, it is conservative to do so because the
term itself is always negative since \({\bf q} \cdot \nabla T < 0\).
This leaves
\[
\rho \dot s \; \ge \; {1 \over T} \dot Q \; - \; {1 \over T} \nabla \cdot {\bf q}
\]
as the applicable 2nd Law of Thermodynamics. It can also be written as
\[
\rho \, T \dot s \; \ge \; \dot Q \; - \; \nabla \cdot {\bf q}
\]
This can be interpreted simply as \(\rho \, T \dot s \ge \) the net change
of heat energy in a control volume.
Recall that the RHS of the equation also appears in the 1st Law. This permits it
to be swapped out to obtain another interesting relationship.
\[
\rho \, T \dot s \; \ge \; \rho \, \dot u \; - \; \boldsymbol{\sigma} : {\bf D}
\]
Helmholtz Free Energy
The Helmholtz free energy, \(\Psi\), is a combination of two state variables,
internal energy and entropy, multiplied by temperature.
\[
\Psi = u - T s
\]
The time rate of change of the Helmholtz free energy is
\[
\dot \Psi = \dot u - \dot T s - T \dot s
\]
Multiplying through by \(\rho\) gives
\[
\rho \dot \Psi = \rho \, \dot u - \rho \, \dot T s - \rho \, T \dot s
\]
Note how this equation has terms in common with the 2nd Law. It is possible
to combine the two equations to produce
\[
\boldsymbol{\sigma} : {\bf D} \ge \rho \dot \Psi + \rho \, s \, \dot T
\]
Now take two important steps. The first is relatively simple... partition
the rate of deformation tensor into elastic and inelastic constituents.
\[
{\bf D} = {\bf D}^\text{el} + {\bf D}^\text{in}
\]
Only the elastic part generates stress. The inelastic part does not. It
represents permanent deformation that is irreversible. In metals, this
is plastic deformation. It is also present in soils, and to a certain
extent, most any type of material. This leads to
\[
\boldsymbol{\sigma} : {\bf D}^\text{el} + \boldsymbol{\sigma} : {\bf D}^\text{in} \ge \rho \dot \Psi + \rho \, s \, \dot T
\]
The next step is to propose a class of constitutive models for the Helmholtz
free energy. This is done at a very high, nonrestrictive, level. For example,
since \(\Psi\) contains internal energy that includes strain energy, it is
logical that elastic strains should be included, so select the elastic part
of the Green strain tensor, \({\bf E}^\text{el}\). Second, since the internal
energy also contains thermal energy, it should also be dependent on temperature,
\(T\). Finally, introduce a group left-over catch-all variables to account for
all other properties, even unknown ones. Group them in a list (like a vector)
and represent them by \(\boldsymbol{\xi}\). They are called
internal state variables, (ISV), and can be things like dislocation density
in metals, or cross-link density in rubber.
\[
\Psi = \Psi({\bf E}^\text{el}, T, \boldsymbol{\xi})
\]
The rate of change of this is
\[
\dot \Psi =
{\partial \Psi \over \partial {\bf E}^\text{el} }
: \dot {\bf E}^\text{el}
+
{\partial \Psi \over \partial T }
\dot T
+
{\partial \Psi \over \partial \boldsymbol{\xi} }
\cdot \dot{\boldsymbol{\xi}}
\]
Inserting this into the above equation gives
\[
\boldsymbol{\sigma} : {\bf D}^\text{el} + \boldsymbol{\sigma} : {\bf D}^\text{in}
\ge
\rho {\partial \Psi \over \partial {\bf E}^\text{el} }
: \dot {\bf E}^\text{el}
+
\rho {\partial \Psi \over \partial T }
\dot T
+
\rho {\partial \Psi \over \partial \boldsymbol{\xi} }
\cdot \dot{\boldsymbol{\xi}}
+ \rho \, s \, \dot T
\]
Recall that \(\dot{{\bf E}}^\text{el} = {{\bf F}^\text{el}}^T \cdot {\bf D}^\text{el} \cdot {\bf F}^\text{el} \).
Substitute this.
\[
\boldsymbol{\sigma} : {\bf D}^\text{el} + \boldsymbol{\sigma} : {\bf D}^\text{in}
\ge
\rho {\partial \Psi \over \partial {\bf E}^\text{el} }
: \left( {{\bf F}^\text{el}}^T \cdot {\bf D}^\text{el} \cdot {\bf F}^\text{el} \right)
+
\rho {\partial \Psi \over \partial T }
\dot T
+
\rho {\partial \Psi \over \partial \boldsymbol{\xi} }
\cdot \dot{\boldsymbol{\xi}}
+ \rho \, s \, \dot T
\]
And group terms together.
\[
\left( \boldsymbol{\sigma} - \rho \, {\bf F}^\text{el} \cdot {\partial \Psi \over \partial {\bf E}^\text{el}} \cdot {{\bf F}^\text{el}}^T \right)
: {\bf D}^\text{el}
+ \boldsymbol{\sigma} : {\bf D}^\text{in}
\ge
\rho \left( s + {\partial \Psi \over \partial T } \right)
\dot T
+
\rho {\partial \Psi \over \partial \boldsymbol{\xi} }
\cdot \dot{\boldsymbol{\xi}}
\]
And now another big logical leap is required. It is to recognize that anyone could impose
any \({\bf D}^\text{el}\) on any material. So the only way to always satisfy the above
equation is to require that the term equals zero. The way to do this is to require
\[
\boldsymbol{\sigma} = \rho \, {\bf F}^\text{el} \cdot {\partial \Psi \over \partial {\bf E}^\text{el}} \cdot {{\bf F}^\text{el}}^T
\]
But recall from the page on
energetic conjugates
that
\[
\boldsymbol{\sigma} = {1 \over J} \, {\bf F} \! \cdot \boldsymbol{\sigma}^{PK2} \cdot {\bf F}^{T}
\]
And it's clear that
\[
\boldsymbol{\sigma}^\text{PK2} = \rho_o \, {\partial \Psi \over \partial {\bf E}^\text{el} }
\]
So the (amazing) result is that the 2nd Piola-Kirchhoff stress is the partial derivative of the
Helmholtz free energy with respect to the Green strain tensor of the elastic deformations.
(the \(\text{el}\) superscripts can be overlooked for now)
Likewise
\[
\boldsymbol{\sigma} : {\bf D}^\text{in}
\ge
\rho {\partial \Psi \over \partial \boldsymbol{\xi} }
\cdot \dot{\boldsymbol{\xi}}
\qquad \text{and} \qquad
s = - {\partial \Psi \over \partial T }
\]
Linear Elasticity
Recall the equation for the 2nd Piola-Kirchhoff stress.
\[
\boldsymbol{\sigma}^\text{PK2} = \rho_o \, {\partial \Psi \over \partial {\bf E}^\text{el} }
\]
It is the most general form of hyperelasticity. It is exactly the
relationship from which the Mooney-Rivlin model of rubber behavior
is developed by proposing that the Helmholtz free energy is a function
of the stretch ratios. But first, formulate a linearized model.
Do so by performing a linear expansion of the equation.
\[
\boldsymbol{\sigma}^\text{PK2} = \rho_o \, {\partial^2 \Psi \over
\partial {\bf E}^\text{el} \partial {\bf E}^\text{el} } :
{\bf E}^\text{el}
\]
This is written compactly as
\[
\boldsymbol{\sigma}^\text{PK2} = {\bf C} : {\bf E}^\text{el}
\]
where \({\bf C}\) is the 4th rank elastic stiffness tensor.
\[
{\bf C} = \rho_o \, {\partial^2 \Psi \over
\partial {\bf E}^\text{el} \partial {\bf E}^\text{el} }
\]
\({\bf C}\) is 3x3x3x3. For example, \(C_{1233}\) relates strain
\(\epsilon_{33}\) to the shear stress \(\sigma_{12}\). So
\[
\sigma_{12} = \text{.....} + C_{1233} \, \epsilon_{33} + \text{.....}
\]
In general, \({\bf C}\) can relate every strain component to every
stress component. The practical challenge is that one cannot write
a 4th rank 3x3x3x3 tensor on 2-D paper. However, this can be
overcome by using so-called Voigt notation. This amounts to
writing
\[
\left\{ \matrix{
\sigma_{11} \\
\sigma_{22} \\
\sigma_{33} \\
\tau_{12} \\
\tau_{23} \\
\tau_{13} }
\right\}
=
\left[ \matrix{
C_{11} & C_{12} & C_{13} & C_{14} & C_{15} & C_{16} \\
& C_{22} & C_{23} & C_{24} & C_{25} & C_{26} \\
& & C_{33} & C_{34} & C_{35} & C_{36} \\
& & & C_{44} & C_{45} & C_{46} \\
& sym & & & C_{55} & C_{56} \\
& & & & & C_{66} }
\right]
\left\{ \matrix{
\epsilon_{11} \\
\epsilon_{22} \\
\epsilon_{33} \\
\gamma_{12} \\
\gamma_{23} \\
\gamma_{13} }
\right\}
\]
Note that this is not conventional matrix forms that tensor
notation can be applied to. Second, the full shear strain values, \(\gamma_{ij}\), are
used here, not the half values, \(\epsilon_{ij}\). Finally, the order that
shear stresses and strains are listed is not universal. Here they are
listed as \(\gamma_{12}, \gamma_{23}, \gamma_{13}\), but other sources
may also list them as \(\gamma_{12}, \gamma_{13}, \gamma_{23}\).