According to my experience, the most difficult part in understanding C++ is however the most basic one: the expression. Only by having a comprehensive understanding of expression, one can further have a clear understanding about lvalue, rvalue, type deduction, auto keyword, universal reference, decltype, move semantics,etc. As you can see, above-mentioned concepts are at the core of C++ 11 and afterwards.

This blog summarize information, concepts, references around expression. It does not provide additional knowledge, but act as a understanding note.

Expressions

C++ expressions

Each C++ expression (an operator with its operands, a literal, a variable name, etc.) is characterized by two independent properties: a type and a value category. Each expression has some non-reference type, and each expression belongs to exactly one of the three primary value categories: prvalue, xvalue, and lvalue.

Type

An expression can have basic type, user-defined type, const/non-const, reference/non-reference types. However, If an expression initially has the type “reference to T” (8.3.2, 8.5.3), the type is adjusted to T prior to any further analysis., which indicates that expression can have reference types. Expressions can have Reference Type:

Today I got email about some information in Effective Modern C++. The email included the statement, “An expression never has reference type.” This is easily shown to be incorrect, but people assert it to me often enough that I’m writing this blog entry so that I can refer people to it in the future.

Section 5/5 of the Standard is quite clear (I’ve put the relevant text in bold):

If an expression initially has the type “reference to T” (8.3.2, 8.5.3), the type is adjusted to T prior to any further analysis. The expression designates the object or function denoted by the reference, and the expression is an lvalue or an xvalue, depending on the expression.

There’d clearly be no need for this part of the Standard if expressions couldn’t have reference type.

If that’s not enough to settle the matter, consider the type of an expression that consists of a function call. For example:
        int& f();                // f returns int&
        auto x = f();            // a call to f
        
What is the type of the expression “f()”, i.e., the type of the expression consisting of a call to f? It’s hard to imagine anybody arguing that it’s not int&, i.e., a reference type. But what does the Standard say? Per 5.2.2/3 (where I’ve again put the relevant text in bold and where I’m grateful to Marcel Wid for correcting the error I had in an earlier version of this post that referred to 5.2.2/10):

If the postfix-expression designates a destructor (12.4), the type of the function call expression is void; otherwise, the type of the function call expression is the return type of the statically chosen function (i.e., ignoring the virtual keyword), even if the type of the function actually called is different. This return type shall be an object type, a reference type or cv void.

It’s very clear that expressions can have reference type. Section 5/5 takes those expressions and strips the reference-ness off of them before doing anything else, but that’s not the same as the reference-ness never being present in the first place.

Value Category

C++ value_category

With the introduction of move semantics in C++11, value categories were redefined to characterize two independent properties of expressions[5]:

has identity: it’s possible to determine whether the expression refers to the same entity as another expression, such as by comparing addresses of the objects or the functions they identify (obtained directly or indirectly);

can be moved from: move constructor, move assignment operator, or another function overload that implements move semantics can bind to the expression.

In C++11, expressions that:

have identity and cannot be moved from are called lvalue expressions;

have identity and can be moved from are called xvalue expressions;

do not have identity and can be moved from are called prvalue (“pure rvalue”) expressions;

do not have identity and cannot be moved from are not used[6].

The expressions that have identity are called “glvalue expressions” (glvalue stands for “generalized lvalue”). Both lvalues and xvalues are glvalue expressions.

The expressions that can be moved from are called “rvalue expressions”. Both prvalues and xvalues are rvalue expressions.

   ______ ______

  /      X      \

 /      / \      \

|   l  | x |  pr  |

 \      \ /      /

  \______X______/

​      gl    r

Above diagram describes the general relationship between lvalue(l), xvalue(x), prvalue(pr), glvalue(gl), rvalue(r)

Type vs ValueCategory

Lvalueness or rvalueness of an expression is independent of its type, it’s possible to have lvalues whose type is rvalue reference, and it’s also possible to have rvalues of the type rvalue reference. See examples from Universal References in C++11, Scott Meyers:

Widget makeWidget();
 // factory function for Widget
Widget&& var1 = makeWidget()
 // var1 is an lvalue, but
 // its type is rvalue reference (to Widget)
Widget var2 = static_cast<Widget&&>(var1);
 // the cast expression yields an rvalue, but
 // its type is rvalue reference (to Widget)

Note that the valueness of expression static_cast is decided by static_cast itself:

As with all cast expressions, the result is:

An lvalue if target-type is an lvalue reference type or an rvalue reference to function type(since C++11);

A xvalue if target-type is an rvalue reference to object type; [swq: how std::move is implemented] (since C++11)

A prvalue otherwise.

Type Deduction

Deduction context

We consider two occasions where type deduction happens:

Function template parameter type deduction
auto

Additionally, a special kind of type deduction, universal reference is considered.

During compile time compiler has mainly two ways to deduce template parameter types: from user and auto deduction. In the case of user input, whatever user specifies, compiler will use them. If user specified reference, reference collapsing rules apply. Also in C++ 17, class template parameter type can also be deduced: Class template argument deduction (CTAD) (since C++17).

Function template parameter type deduction

Note: Most content of this paragraph comes from the book: Effective Modern C++, Scott Meyers. I only copies them here to make this artical complete.

Take:

template<typename T>
void f(ParamType param);
f(expr); // deduce T and ParamType from expr

// Above pseudo code can represent most cases, since reference, const are not allowed insdie parameter list: template<typename const T> and template<typename T&> and template<typename T&&> are both not valid. However, when user specify T, const and reference types can be used and reference collapsing rules apply.

Note: typename can only be class, or typename, no additional qualifiers are possible: type-parameter-key -either typename or class

Case 1: ParamType is a Reference or Pointer, but not a Universal Reference
- If expr’s type is a reference, ignore the reference part
- Then pattern-match expr’s type against ParamType to determine T.
Case 2: ParamType is a Universal Reference
- If expr is an lvalue, both T and ParamType are deduced to be lvalue references. That’s doubly unusual. First, it’s the only situation in template type deduction where T is deduced to be a reference. Second, although ParamType is declared using the syntax for an rvalue reference, its deduced type is an lvalue reference.
- If expr is an rvalue, the “normal” (i.e., Case 1) rules apply.
Case 3: ParamType is Neither a Pointer nor a Reference
- As before, if expr’s type is a reference, ignore the reference part
- If, after ignoring expr’s reference-ness, expr is const, ignore that, too. If it’s volatile, also ignore that.

auto deduction

It’s essentially the same as function template parameter type deduction like above, the mappings relationships are:

auto –> T
Expression before = –> ParamType
Expression after = –> expr

For example:

auto x = 27; // case 3 (x is neither ptr nor reference)
const auto cx = x; // case 3 (cx isn't either)
const auto& rx = x; // case 1 (rx is a non-universal ref.)
// T --> auto; const auto& --> ParamType; x --> expr

auto can also be used in lamda and support universal reference:

#include <algorithm>
#include <iostream>
#include <iterator>
#include <utility>
#include <vector>

class A {
public:
  // Default constructor
  A() { std::cout << "Default constructor called\n"; }

  // Destructor
  ~A() { std::cout << "Destructor called\n"; }

  // Copy constructor
  A(const A &) { std::cout << "Copy constructor called\n"; }

  // Move constructor
  A(A &&) noexcept { std::cout << "Move constructor called\n"; }

  // Copy assignment operator
  A &operator=(const A &) {
    std::cout << "Copy assignment operator called\n";
    return *this;
  }

  // Move assignment operator
  A &operator=(A &&) noexcept {
    std::cout << "Move assignment operator called\n";
    return *this;
  }
};

int main() {

  std::vector<A> vec(2);
  std::for_each(vec.begin(), vec.end(),
                // e is of lvalue, copy contructor called
                // Note: e is not expression here, decltype get the real type of
                // e, for universal reference, it's either non-reference type or
                // lvalue reference type
                [](auto &&e) { auto a = std::forward<decltype(e)>(e); });

  std::for_each(std::make_move_iterator(vec.begin()),
                std::make_move_iterator(vec.end()),
                // e is of xvalue, move contructor called
                // Note: e is not expression here, decltype get the real type of
                // e, for universal reference, it's either non-reference type or
                // lvalue reference type
                [](auto &&e) { auto a = std::forward<decltype(e)>(e); });
}

More about Universal Reference

Universal References in C++11, Scott Meyers

Reference Collapsing Rules

C++ Reference

An rvalue reference to an rvalue reference becomes (‘collapses into’) an rvalue reference.
All other references to references (i.e., all combinations involving an lvalue reference) collapse into an lvalue reference.

Key Points & Golden Rules

Remember that “&&” indicates a universal reference only where type deduction takes place. Where there’s no type deduction, there’s no universal reference. In such cases, “&&” in type declarations always means rvalue reference.
Apply std::move to rvalue references and std::forward to universal references
Only use std::forward with universal references
Universal reference type deduction is the only situation a template parameter is deduced as reference(when passed type is of lvalue).

std::forward explained

std::forward

Only use std::forward with universal reference. So the template argument for it should always be deduced, instead of specified.
- This implies that the deduced type T is either non-reference type or a lvalue reference.
- Always use universal reference template paramter T directly for std::forward’s template paramters, do not manually add additional const/reference to it. Note that T can be decorated: like forward<decltype(forward<T>(arg).get())>, as long as T itself is the same as the universal reference.
std::forward is an expression and function, it’s value category conform to normal C++ expression rules. It always return reference:
- When return lvalue reference, its value category is lvalue
- When return rvalue reference, its value category is rvalue(xvalue or prvalue)
- Above behavior is due to the fact that when cast to rvalue reference, the result of static_cast is of xvalue; when cast to non-reference, the result is of prvalue; when cast to lvalue reference, the result is of lvalue.
The type and valueness is decided by static_cast, which is the internal implementation of std::forward

The standard implements std::forward:

  /**
   *  @brief  Forward an lvalue.
   *  @return The parameter cast to the specified type.
   *
   *  This function is used to implement "perfect forwarding".
   */
  template<typename _Tp>
    constexpr _Tp&&
    forward(typename std::remove_reference<_Tp>::type& __t) noexcept
    { return static_cast<_Tp&&>(__t); }

  /**
   *  @brief  Forward an rvalue.
   *  @return The parameter cast to the specified type.
   *
   *  This function is used to implement "perfect forwarding".
   */
  template<typename _Tp>
    constexpr _Tp&&
    forward(typename std::remove_reference<_Tp>::type&& __t) noexcept
    {
      static_assert(!std::is_lvalue_reference<_Tp>::value, "template argument"
		    " substituting _Tp is an lvalue reference type");
      return static_cast<_Tp&&>(__t);
    }

_Tp is the type deduced from universal references, which might be non-reference type or lvalue reference type. __t is the parameter passed to std::forward, which is of lvalue category, since inside function, parameters are passed always as lvalues. The aim of std::forward is to restore the value category of the passed parameter:

When _Tp is of non-reference type, which means the universal reference is deduced based on passed rvalue parameter:
- __t is of _Tp&& type, the second overload is used, std::forward return rvalue category, rvalue reference type object
When _Tp is of lvalue reference type, which means the universal reference is deduced based on passed lvalue parameter:
- __t is of _Tp& type, the first overload is used, std::forward return lvalue category, lvalue reference type object

Note: Type is not changed when parameters are passed down to nested function calls, only value category are changed. This is how std:forward do overload based on types

Usage1: Forwards lvalues as either lvalues or as rvalues, depending on T:

template<class T>
void wrapper(T&& arg)
{
    // arg is always lvalue
    foo(std::forward<T>(arg)); // Forward as lvalue or as rvalue, depending on T
}

When T is deduced to non-reference type,the expression std::forward is of:
- type: rvalue reference to T
- value category: rvalue
When T is deduced to lvalue reference type, the expression std::forward is of:
- type: lvalue reference to T
- value category: lvalue

Note: This usage is to restore the value category of the passed parameter, whose valueness, due to the nested argument passing, is all treated as lvalue category.

Usage2: Forwards rvalues as rvalues and prohibits forwarding of rvalues as lvalues.

struct Arg
{
    int i = 1;
    int  get() && { return i; } // call to this overload is rvalue
    int& get() &  { return i; } // call to this overload is lvalue
};
// transforming wrapper
template<class T>
void wrapper(T&& arg)
{
    foo(forward<decltype(forward<T>(arg).get())>(forward<T>(arg).get()));
}

The decltype(forward<T>(arg).get()) evalues the type:
- If T is deduced to non-reference type, the expression forward<T>(arg).get() will have type int and have xvalueness. decltype will result in type int&&
- If T is deduced to lvalue reference type, the expression forward<T>(arg).get() will have type int& and have lvalueness. decltype will result int type int&
The second appearance of forward<T>(arg).get() just call proper implementation of get()

Note: when using decltype as parameter argument to std::forward, we only need to care about whether the expression in decltype is lvalue or rvalue, no need to care about the type(only need to know the non-reference type of T). Since if the expression is lvalue, decltype will result in lvalue reference type, which result in lvalue reference type(lvalueness) for the std::forward. Otherwise, if the expression inside decltype is of xvalue or prvalue, decltype results in non-reference type or rvalue reference type, which both result in a rvalue reference type(rvalueness) for the std::forward expression. decltype and std::forward together to pass the valueness down to nested function calls.

Note: If we use foo(forward<int&>(forward<T>(arg).get())); and pass a rvalue of Arg instance to wrapper, there will be compile time error, since we try to forward rvalue as lvalue(the valueness of the return of std::forward).

std::forward can be used together with auto&& in lambdas, which needs the help of decltype to infer the correct type:

  std::vector<std::string> source = {"a", "b", "c"};

  //   This is totally generic
  auto forwarder = [](auto &&container) {
    // If container is of lvalue, it's type is lvalue reference type, decltype
    // return lvalue reference.
    // If container is of rvalue, it's type is rvalue reference type, decltype
    // return rvalue reference
    auto dest = std::forward<decltype(container)>(container);
  };

  forwarder(std::move(source));
  std::cout << source.size() << std::endl;

  std::vector<std::string> source2 = {"x", "y", "z"};
  forwarder(source2);
  std::cout << source2.size() << std::endl;

However, using of std::forward in range-based for loop with auto&& will not work as expected:

  std::vector<std::string> source = {"a", "b", "c"};

  //   item is std::string& type, even with std::move
  for (auto &&item : std::move(source)) {
    // After std::forward the return  type is still std::string&
    auto dst = std::forward<decltype(item)>(item);
  }
  std::cout << source[0] << std::endl;

In above code snippet, std::move(source) will not trigger move operation for source and item is of lvalue category and lvalue reference type. This behavior can be explained by looking at the internal implementation of range-based for in C++:

{

auto&& /* range */ = range-initializer ﻿;
for (auto /* begin */ = /* begin-expr */, /* end */ = /* end-expr */;
/* begin */ != /* end */; ++/* begin */)
{
item-declaration = */* begin */;
statement
}
}

First the auto&& universal reference is used to create a local variable, which is itself a lvalue
Then use iterator to iterate over this lvalue and dereference this lvalue and create a local variable using item-declaration, which is the user specified auto &&item
auto &&item will always be deduced as lvalue reference
If auto item is used, item will be a copy, and the original const qualifier will be removed

To iterate and move element out of a container, move iterators can be used:

  std::vector<std::string> source = {"a", "b", "c"};

  //   This will move the element
  for (auto it = std::make_move_iterator(source.begin());
       it != std::make_move_iterator(source.end()); ++it) {
    auto dst = *it;
  }
  std::cout << source[0] << std::endl;

Named Variables

Named variables and parameters of rvalue reference type are lvalues. Also from Universal References in C++11, Scott Meyers:

template<typename T>
class Widget {
 ...
 Widget(Widget&& rhs);
 // rhs's type is rvalue reference, but rhs
 // itself is an lvalue
 ...
};
template<typename T1>
class Gadget {
 ...
 template <typename T2>
 Gadget(T2&& rhs);
 // rhs is a universal reference whose type will
 // eventually become an rvalue reference or an
 // lvalue reference, but rhs itself is an lvalue
 ...
};

To create rvalue, std::move, std::forward, static_cast specifier has to be used:

#include <iostream>
#include <utility>
class A {
public:
  // Default constructor
  A() { std::cout << "Default constructor called\n"; }

  // Destructor
  ~A() { std::cout << "Destructor called\n"; }

  // Copy constructor
  A(const A &) { std::cout << "Copy constructor called\n"; }

  // Move constructor
  A(A &&) noexcept { std::cout << "Move constructor called\n"; }

  // Copy assignment operator
  A &operator=(const A &) {
    std::cout << "Copy assignment operator called\n";
    return *this;
  }

  // Move assignment operator
  A &operator=(A &&) noexcept {
    std::cout << "Move assignment operator called\n";
    return *this;
  }
};

int main() {

  A a;
  A &&b = std::move(a);

  // ! This will call copy constructor, NOT move constructor, even b is rvalue
  // reference type, but it's lvalue
  A c(b);

  // ! This will call move constructor
  A d(std::move(c));
}

decltype

C++ decltype

Inspects the declared type of an entity or the type and value category of an expression. This implies two funtionality of decltype:

When used as decltype ( entity ), where entity is unparenthesized id-expression or class memeber expression, it yields the type of entity
When used as decltype ( expression ), where expression is any other expression, it inspects the expression’s value type(eg,T) and value category and yields following types accordingly:

if value category of expression is xvalue, it yields T&&
if value category of expression is lvalue, it yields T&
if value category of expression is prvalue, it yields T

Note: if variable id-expression or class memeber access expression is parenthesized, it is treated as ordinary lvalue expression(which is reasonable, because named variables are always lvalue expressions)

class Widget {};
Widget makeWidget() { return Widget(); }

int main() {

  Widget &&var1 = makeWidget();
  // var1 is an lvalue, but
  // its type is rvalue reference (to Widget)
  Widget var2 = static_cast<Widget &&>(var1);
  // the cast expression yields an rvalue, but
  // its type is rvalue reference (to Widget)
  decltype(static_cast<Widget &&>(var1)) var3 = makeWidget();
  //   expression type is Widget &&, value category of expression is xvalue,
  //   first get the non-reference type Widget, so var3 is of Widget&& type.
  Widget &var4 = var2;
  decltype(std::move(var4)) var5 = makeWidget();
  // expression type is Widget &, value category of expression is xvalue; first
  // get the non-reference type Widget, so var5 is of Widget && type

  decltype((var4)) var6 = var2;
  // expression type is Widget &, value category of expression is lvalue; first
  // get the non-reference type Widget, so var6 is of Widget & type

  decltype(makeWidget()) var7 = makeWidget();
  // expression type is Widget , value category of expression is prvalue; first
  // get the non-reference type Widget, so var6 is of Widget  type
}

Note: When doing all deduction the expression type T will use the non-reference version, since as the standard says:

If an expression initially has the type “reference to T” (8.3.2, 8.5.3), the type is adjusted to T prior to any further analysis.

decltype links valueness of an expression to type.

Print valueness

First approach, we can check whether the yield type of decltype(expression) is lvalue or rvalue reference, if rvalue reference, the expression is xvalue; if lvalue reference, the expression is lvalue; otherwise, the expression is prvalue

#include <iostream>
#include <utility> // for std::move

class Widget {};
Widget makeWidget() { return Widget(); }

template <typename T> struct value_category {
  static constexpr const char *str() { return "prvalue"; }
};

template <typename T> struct value_category<T &> {
  static constexpr const char *str() { return "lvalue"; }
};

template <typename T> struct value_category<T &&> {
  static constexpr const char *str() { return "xvalue"; }
};

// Macro to check the value category of an expression
#define PRINT_VALUE_CATEGORY(expr)                                             \
  std::cout << "The expression '" #expr "' is a "                              \
            << value_category<decltype((expr))>::str() << std::endl;

int main() {
  Widget &&var1 = makeWidget();
  Widget var2 = static_cast<Widget &&>(var1);
  Widget &var4 = var2;

  PRINT_VALUE_CATEGORY(var1);                         // lvalue
  PRINT_VALUE_CATEGORY(static_cast<Widget &&>(var1)); // xvalue
  PRINT_VALUE_CATEGORY(var4);                         // lvalue
  PRINT_VALUE_CATEGORY((var4));                       // lvalue
  PRINT_VALUE_CATEGORY(std::move(var4));              // xvalue
  PRINT_VALUE_CATEGORY(makeWidget());                 // prvalue
}

static_cast vs decltype

Let’s compare static_cast and decltype:

With static_cast:

As with all cast expressions, the result is:

An lvalue if target-type is an lvalue reference type or an rvalue reference to function type(since C++11);

A xvalue if target-type is an rvalue reference to object type; [swq: how std::move is implemented] (since C++11)

A prvalue otherwise.(non-reference type)

With decltype:

if value category of expression is xvalue, it yields T&&

if value category of expression is lvalue, it yields T&

if value category of expression is prvalue, it yields T

They do reverse operations:

static_cast sets the type/value category (via the target type).
decltype infers the type/value category (from the expression).

declval

C++ declval

declval can return a reference to a type, without going through actual construction.

Implementation:

template<typename T>
typename std::add_rvalue_reference<T>::type declval() noexcept
{
    static_assert(false, "declval not allowed in an evaluated context");
}

What declval does is simply add rvalue reference to type T
The static_cast statement make sure that it can only be used in unevaluated context, like inside decltype

Why add rvalue reference, instead of lvalue reference?

The reason is related to reference collapsing rules: only by adding rvalue reference, declval might have the possibility return a rvalue reference, so as to have more possibility to call methods, such as methods that can only be called by rvalue.