GPAW: An open Python package for electronic-structure calculations

Jens Jørgen Mortensen Corresponding author: jjmo@dtu.dk CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Ask Hjorth Larsen CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Mikael Kuisma CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Aleksei V. Ivanov Riverlane Ltd, St Andrews House, 59 St Andrews Street, Cambridge, CB2 3BZ, United Kingdom Alireza Taghizadeh CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Andrew Peterson School of Engineering, Brown University, Providence, Rhode Island, USA Anubhab Haldar Department of Electrical and Computer Engineering, Boston University, United States of America. Asmus Ougaard Dohn Department of Physics, Technical University of Denmark, 2800 Lyngby, Denmark; Science Institute and Faculty of Physical Sciences, VR-III, University of Iceland, Reykjavík 107, Iceland Christian Schäfer Department of Physics, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden Elvar Örn Jónsson Science Institute and Faculty of Physical Sciences, University of Iceland, VR-III, 107 Reykjavík, Iceland Eric D. Hermes Quantum-Si, 29 Business Park Drive, Branford, CT 06405, USA Fredrik Andreas Nilsson CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Georg Kastlunger CatTheory, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Gianluca Levi Science Institute and Faculty of Physical Sciences, University of Iceland, VR-III, 107 Reykjavík, Iceland Hannes Jónsson Science Institute and Faculty of Physical Sciences, University of Iceland, VR-III, 107 Reykjavík, Iceland Hannu Häkkinen Departments of Physics and Chemistry, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland Jakub Fojt Department of Physics, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden Jiban Kangsabanik CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Joachim Sødequist CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Jouko Lehtomäki Department of Applied Physics, Aalto University, P.O. Box 11100, 00076 Aalto, Finland Julian Heske CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Jussi Enkovaara CSC – IT Center for Science Ltd., P.O. Box 405, FI-02101 Espoo, Finland Kirsten Trøstrup Winther SUNCAT Center for Interface Science and Catalysis, SLAC National Accelerator Laboratory, Menlo Park, California 94025, United States Marcin Dulak CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Marko M. Melander Department of Chemistry, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland Martin Ovesen CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Martti Louhivuori CSC – IT Center for Science Ltd., P.O. Box 405, FI-02101 Espoo, Finland Michael Walter University of Freiburg, FIT Freiburg Centre for Interactive Materials and Bioinspired Technologies, Georges-Köhler-Allee 105, 79110 Freiburg, Germany Morten Gjerding CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Olga Lopez-Acevedo Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, 050010 Medellin, Colombia Paul Erhart Department of Physics, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden Robert Warmbier School of Physics and Mandelstam Institute for Theoretical Physics, University of the Witwatersrand, 1 Jan Smuts Avenue, 2001, Johannesburg, South Africa Rolf Würdemann Freiburger Materialforschungszentrum, Universität Freiburg, Stefan-Meier-Straße 21, D-79104 Freiburg, Germany Sami Kaappa Computational Physics Laboratory, Tampere University, P.O. Box 692, FI-33014 Tampere, Finland Simone Latini Nanomade, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Tara Maria Boland CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Thomas Bligaard Department of Energy Conversion and Storage, Technical University of Denmark, DK-2800 Lyngby, Denmark Thorbjørn Skovhus CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Toma Susi University of Vienna, Faculty of Physics, Boltzmanngasse 5, 1090 Vienna, Austria Tristan Maxson Department of Chemical and Biological Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA Tuomas Rossi CSC – IT Center for Science Ltd., P.O. Box 405, FI-02101 Espoo, Finland Xi Chen School of Physical Science and Technology, Lanzhou University, Lanzhou, Gansu 730000, China Yorick Leonard A. Schmerwitz Science Institute and Faculty of Physical Sciences, University of Iceland, VR-III, 107 Reykjavík, Iceland Jakob Schiøtz CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Thomas Olsen CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Karsten Wedel Jacobsen CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark Kristian Sommer Thygesen CAMd, Department of Physics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark

(May 1, 2024)

Abstract

We review the GPAW open-source Python package for electronic structure calculations. GPAW is based on the projector-augmented wave method and can solve the self-consistent density functional theory (DFT) equations using three different wave-function representations, namely real-space grids, plane waves, and numerical atomic orbitals. The three representations are complementary and mutually independent and can be connected by transformations via the real-space grid. This multi-basis feature renders GPAW highly versatile and unique among similar codes. By virtue of its modular structure, the GPAW code constitutes an ideal platform for implementation of new features and methodologies. Moreover, it is well integrated with the Atomic Simulation Environment (ASE) providing a flexible and dynamic user interface. In addition to ground-state DFT calculations, GPAW supports many-body GW band structures, optical excitations from the Bethe–Salpeter Equation (BSE), variational calculations of excited states in molecules and solids via direct optimization, and real-time propagation of the Kohn–Sham equations within time-dependent DFT. A range of more advanced methods to describe magnetic excitations and non-collinear magnetism in solids are also now available. In addition, GPAW can calculate non-linear optical tensors of solids, charged crystal point defects, and much more. Recently, support of GPU acceleration has been achieved with minor modifications of the GPAW code thanks to the CuPy library. We end the review with an outlook describing some future plans for GPAW.

I Introduction

The electronic-structure (ES) problem, i.e. the solution to the time-independent Schrödinger equation for a collection of electrons and atomic nuclei, forms the starting point for the quantum-mechanical treatment of matter. Indeed, all chemical and physical properties of any substance (solid, molecule, surface, etc.) can in principle be obtained from the energies and wave functions that constitute the solution. A pioneering step towards solving the many-body ES problem was the formulation and formal proof of density functional theory (DFT) by Hohenberg and Kohn in 1964 [1], and a practical scheme for its solution by Kohn and Sham in 1965 [2]. Today, most codes solving the ES problem from first principles are based on DFT. Such codes are extremely powerful and allow one to determine the atomic structure of solids and molecules containing hundreds of atoms with a relative error below 1% [3, 4, 5]. Once the atomic structure of the compound has been solved, its properties (electronic, magnetic, optical, topological, etc.) can in principle be determined. The evaluation of properties often involves theories beyond the formal DFT framework to account for effects such as temperature and lattice vibrations [6, 7], many-body interactions in excited states [8, 9], or time dependence [10, 11]. As such, first-principles atomistic calculations often involve two successive phases: the solution of the ground-state ES problem (including ion dynamics) and the subsequent evaluation of physical properties. This review is structured accordingly as Secs. III-IV deal with the first phase while Secs. V-IX are devoted to the second.

In recent years, the scientific significance of ES codes has shifted from a useful tool to describe and understand matter at the atomic scale to an independent driver of the discovery and development of new materials [12, 13, 14, 15, 16]. This change in scope has been fueled by the exponential increase in computer power accompanied by improved numerical algorithms [17, 18] as well as the use of workflow management software for high-throughput computations [19, 20, 21, 22] and the adoption of machine-learning techniques to leverage the rapidly growing data generated by ES codes [23, 24, 25]. In parallel with these capacity-extending developments, continuous progress in the fundamental description of exchange–correlation effects has advanced the predictive power of ES calculations to a level where they rival experiments in terms of accuracy for many important properties [26, 27, 28, 29, 30, 31].

The GPAW code was originally intended as a Python-based multigrid solver of the basic DFT equations within the projector-augmented wave (PAW) formalism [32]. The name GPAW accordingly was an abbreviation for “grid-based projector-augmented waves”. Today, other choices than regular grids for representations of the wave functions exist in GPAW, but the name has stuck. During the years 2005–2010, GPAW evolved to a full-blown DFT package [33] supporting most of the functionality expected from a modern ES code, in addition to a few more specialised features including real-time propagation of wave functions [34] and an alternative basis of numerical atomic orbitals (referred to as the LCAO basis) [35] to supplement the real-space grid. In 2011, a plane-wave (PW) basis set was also implemented. Today, the possibility to use three different types of basis sets and even combining them within a single run remains a unique feature of GPAW, rendering the code very versatile.

The implementation of the PW basis set laid the groundwork for GPAW’s linear-response module, which today supports the calculation of linear response functions [36], total energies from the adiabatic connection fluctuation-dissipation theorem [29, 37], the GW self-energy method for quasiparticle band structures [38], the Bethe–Salpeter Equation (BSE) for optical excitations [39], and more. The code also supports a wide range of features related to the calculation of magnetism and spin–orbit effects. Examples include spin-spiral calculations using the generalized Bloch theorem [40], external magnetic fields, orbital magnetization, magnetic anisotropy [41], adiabatic magnon dispersions from the magnetic force theorem [42], and dynamic magnetic response from TDDFT [43]. For solids, the k-space Berry phases can be computed directly from the Bloch orbitals and may be used to obtain the spontaneous polarization [44], Born effective charges, piezoelectric response tensors [45] and various indices characterising the band topology [46].

In addition, GPAW can compute the localisation matrices forming the basis for the construction of Wannier functions with e.g. the Atomic Simulation Environment (ASE) [47] or Wannier90 [48]. Electrostatic corrections to the formation energies of charged point defects in insulators are implemented as are calculations of the hyperfine coupling and zero-field splitting for localised electron spins. GPAW also offers the possibility to perform time-independent, variational calculations of localised electronic excitations, in e.g. molecules or at crystal point defects, using direct orbital optimisation strategies implemented for all three types of basis sets [49, 50, 51]. This provides an efficient and robust alternative to traditional “ $\Delta$ SCF” approaches. GPAW can also be used to describe ultrafast electron dynamics within time-dependent density functional theory (TDDFT) with wave functions represented either on a real space grid [34] or in the LCAO basis [52]. The latter can provide a significant speed-up due to the relatively small size of the basis [53, 54, 55]. The LCAO representation also forms the basis for calculation of electron–phonon couplings as well as non-linear optical spectra such as Raman scattering [56] (which can alternatively be obtained in the PW mode as a finite difference of the dielectric tensor), second-harmonics generation [57], and shift currents [58] using higher-order perturbation theory.

II Why GPAW?

II.1 User’s perspective

There are dozens of electronic-structure codes available for the interested user. The codes differ in their license (in particular, whether open or proprietary), the underlying programming language (e.g. Fortran, C, Python), their treatment of core electrons (all-electron versus pseudopotentials), the employed representations of the wave functions (plane waves, atom-centered orbitals, real-space grids), and the beyond-DFT features they support. Why should one choose GPAW?

In this section, we describe some of the features that make GPAW interesting from the point of view of a common user who wants to perform electronic-structure calculations. The next section focuses on its possibilities for more advanced users, who perhaps want to modify the code or implement completely new functionalities.

A first point to note is that GPAW is written almost exclusively in Python and is directly integrated with the Atomic Simulation Environment. This integration with ASE makes the setup, control, and analysis of calculations easy and flexible. The programming language is of course a key issue for developers, but also the common user benefits from Python and the ASE/GPAW integration. A typical stand-alone program only offers a fixed (though of course possibly large) set of tasks that it can perform, while Python scripting allows for a more flexible use of the code. This could for example mean combining several different GPAW calculations in new ways. Another advantage is that “inner parts” of the code like the density or the Kohn–Sham eigenvalues are directly accessible in a structured format within Python for further analysis. It is even possible to “open up the main loop” of GPAW and have access to, inspect, and also modify key quantities during program execution (see Fig. 1).

As already mentioned in the introduction, GPAW distinguishes itself from other available ES codes by supporting three different ways of representing the wave functions. The most commonly used basis set is plane waves (PW), which is appropriate for small or medium-size systems, where high precision is required. Convergence is easily and systematically controlled by tuning the cut-off energy. A large number of advanced features and “beyond-DFT” methods are available in the PW mode. These include the calculation of hybrid functionals, RPA total energies, linear-response TDDFT, and many-body perturbation theory techniques like GW and the Bethe–Salpeter equations. The new GPU implementation also uses the PW mode.

The wave functions can alternatively be represented on real-space grids, which was the original approach in GPAW. The implementation of this so-called finite-difference (FD) mode relies on multi-grid solutions of the Poisson and Kohn–Sham equations. The FD mode allows for more flexible boundary conditions than the PW mode, which is restricted to periodic supercells. The boundary conditions may for example be taken to reflect the charge distribution in the unit cell. Calculations in the FD mode can be systematically converged through lowering of the grid spacing, but the approach to full convergence is slower than in the PW mode. The FD mode is particularly well suited for large systems because the wave-function representation allows for large-scale parallelization through real-space decomposition. Furthermore, it is possible to perform time-propagation TDDFT including Ehrenfest dynamics in this mode.

The third representation of the wave functions is a basis of numerical atom-centered orbitals in the linear combination of atomic orbitals (LCAO) mode. The size of the basis set can be varied through inclusion of more angular momentum channels, additional orbitals within a channel, or polarization functions. GPAW comes with a standard set of orbitals, but a basis-set generator is included with the code so that users may construct different basis sets depending on their needs and requirements. The LCAO mode is generally less accurate than the PW and FD modes, but it allows for the treatment of considerably larger systems – more than ten thousand atoms. It is also possible to study electron dynamics through a fast implementation of time-propagation DFT, and Ehrenfest dynamics is under development.

As explained, the different modes have different virtues and limitations, and it can therefore be an advantage to apply several modes in a project. For larger systems, it is for example possible to divide a structure optimization into two steps. First, an optimization is performed with the fast LCAO basis leading to an approximately correct structure. This is then followed by an optimization in either the PW or FD mode, which now requires much fewer steps because of the good initial configuration. Due to the ASE/Python interface this combined calculation can easily be performed within a single script.

Since GPAW was originally created with the FD mode only, and the LCAO mode was added next, some features have been implemented for only those modes. Examples are real-time TDDFT (see section VII) and electron–phonon coupling (see section VI.7). Conversely, some new features only work for the PW mode, which was added after the real-space modes. Examples are RPA total energies (see section VI.3) and calculation of the stress tensor. To summarize, given the limitations just mentioned, users should most of the time use PW or LCAO mode and the choice will depend on the accuracy needed and the resources available.

II.2 Developer’s perspective

The GPAW source code is written in the Python and C languages and is hosted on GitLab [59] licensed under the GNU General Public License v3.0. This ensures the transparency of all features and allows developers to fully customise their experience and contribute new features to the community.

An advantage of having a Python code is that the Python script you write to carry out your calculations will have access to everything inside a GPAW calculation. An example showing the power and flexibility this affords is the possibility to have user-code inserted inside the self-consistent field (SCF) loop as demonstrated in Fig. 1.

Refer to caption — Figure 1: The variable calc is the ground-state DFT calculator object and its icalculate method yields a context object at every self-consistent field (SCF) step. As seen, one can use this in a for-loop to implement special logic for termination of the SCF iterations or for diagnostics. In this example, the memory usage is written to the log-file for the first 15 SCF iterations.

At the time of this writing (July 2023), GPAW has two versions of the ground-state DFT code in the main branch of the code. There is the older version that has grown organically since the birth of GPAW: it has many features, but also a lot of technical debt that makes it harder to maintain and less ideal to build new features on top of. The newer ground-state code addresses these issues by having a better overall design.

The new design greatly improves the ease of implementation of new features. The goal is to have the new code be feature-complete so that it can pass the complete test suite and then delete the old code once that is achieved. At the moment, we recommend that all production calculations are done with the old code and that work on new features is done on top of the new code even though certain features are not yet production-ready. Three new features, not present in the old code base, have already been implemented based on the new code: GPU implementation of PW mode calculations (see section III.2.9), reuse of the wave functions after unit-cell changes during cell optimization, and spin-spiral calculations (see section V.5).

GPAW uses pytest [60] for its test suite that currently consists of approximately 1600 unit and integration tests (see Table 1). A subset of those tests runs as part of GitLab’s continuous integration (CI) thereby checking the correctness of every code change. Unfortunately, the full test suite is too time-consuming to run as part of CI, so we run that nightly both in serial as well as in parallel using MPI.

Many of the code examples in GPAW’s documentation, exercises and tutorials [61] require resources (time and number of CPUs) beyond what would make sense to run as part of the pytest test suite. For that, we use MyQueue [21] to submit those scripts as jobs to a local supercomputer every weekend. At the moment this amounts to approximately 5200 core-hours of calculations.

As can be seen from Table 1, the majority of the code is written in Python, which is an interpreted language that is easy to read, write and debug.

Table 1: Number of files and number of lines of code in the git repository of GPAW. The Python source-code files are split into three parts: the actual code, the test suite, and code examples in the documentation.

Type of file	Files	Lines
Python (the code)	513	146,604
C	80	19,719
Python (test suite)	681	47,147
Python (documentation)	744	32,014

Interpreter-executed code will not run as efficiently as code that is compiled to native machine code. It is therefore important to make sure that the places in the code where most of the time is spent (hot spots) are in native machine code and not in the interpreter. GPAW achieves this by implementing the hot spots in C-code with Python wrappers that can be called from the Python code. Examples of such computationally intensive tasks are applying a finite-difference stencil to a uniform grid, interpolating from one uniform grid to another, or calculation of overlaps between projector functions and wave functions. In addition, we have Python interfaces to the numerical libraries FFTW [62], ScaLAPACK [63], ELPA [17], BLAS, Libxc [64, 65], libvdwxc [66], and MPI. Finally, GPAW makes heavy use of the NumPy [67] and Scipy [68] Python packages. NumPy provides us with the numpy.ndarray data type which is an $N$ -dimensional array that we use for storing wave functions, electron densities, potentials, matrices like the overlap matrix or the LCAO wave function coefficients, and much more. The use of NumPy arrays allows us to use the many sub-modules of SciPy to manipulate data. This also gives us an efficient memory layout allowing us to simply pass a pointer to the memory whenever we need to call the C-code from the Python code. With this strategy, we can get away with having most of the code written in a relatively slow interpreted language and still have most of the time spent in highly efficient C-code or optimized numerical libraries.

The advantage of the original FD-mode, where there are no Fourier transforms of the wave functions, is that the algorithms should parallelize well for large systems. In practice, it has turned out that FD-mode has a number of disadvantages: 1) Due to integrals over the unit cell being done as sums over grid-points, there will be a small periodic energy variation as you translate atoms and the period of the variation will be equal to the grid-spacing used (the so-called egg-box error); 2) The system sizes that are typically most interesting for applications of DFT are too small for the parallel scalability to be the decisive advantage; 3) The memory used to store the wave functions on uniform grids in real-space is significant. In contrast, the PW mode has practically no egg-box error, is very efficient for the most typical system sizes and often uses a factor of 10 less memory compared to an FD mode calculation of similar accuracy. The main advantages for LCAO mode are low memory usage and high efficiency for large systems; for small unit cells with many $\mathbf{k}$ -points the PW mode is most efficient. One disadvantage of LCAO-mode is egg-box errors: LCAO-mode uses the same uniform grids as used in FD-mode for integration of matrix elements like $\langle\Phi_{\mu}|\tilde{v}|\Phi_{\nu}\rangle$ and therefore has similar egg-box energy variation. A second disadvantage of LCAO-mode is that as for any localized basis-set, reaching the complete basis-set limit is more involved compared to PW and FD modes. This can have severe consequences even for ground state calculations of difficult systems such as Cr₂ for example[wurdemann_density_2015]. In PW or FD modes the complete basis set limit is easy to reach by simply increasing the number of plane-waves or grid-points, respectively, which leads to a smooth convergence[32]. At the moment we only provide double- $\zeta$ polarized (DZP) basis sets and going beyond DZP is left for users to do themselves.

III Ground-state DFT

III.1 Projector augmented-wave method

The diverging Coulomb potential causes rapid oscillations in electronic wave functions near the nuclei, and special care is required to be able to work with smooth wave functions. The projector augmented-wave (PAW) method by Blöchl [69] is a widely adopted generalization of pseudopotential methods, utilizing their strength of smooth pseudo wave functions while retaining a mapping from all-electron wave functions ( $|\psi_{n}\rangle$ ) to pseudo wave functions ( $|\tilde{\psi}_{n}\rangle$ ).

The crux is to define a linear transformation $\hat{T}$ from pseudo to all-electron space,

\hat{T}|\tilde{\psi}\rangle=|\psi\rangle,

(1)

where

\hat{T}=1+\sum_{a}\sum_{i}\left(|\phi^{a}_{i}\rangle-|\tilde{\phi}^{a}_{i}% \rangle\right)\langle\tilde{p}^{a}_{i}|.

(2)

Here $\tilde{p}^{a}_{i}(\mathbf{r})$ , $\phi^{a}_{i}(\mathbf{r})$ and $\tilde{\phi}^{a}_{i}(\mathbf{r})$ are called projectors, partial waves and pseudo-partial waves, respectively. The pseudo-partial waves and projectors are chosen to be biorthogonal, $\int{\rm d}{\mathbf{r}}\,\tilde{\phi}^{a}_{i}(\mathbf{r})\tilde{p}^{a}_{j}(% \mathbf{r})=\delta_{ij}$ , allowing for an approximate closure relation

\sum_{i}\tilde{\phi}^{a}_{i}(\mathbf{r})\tilde{p}^{a}_{i}(\mathbf{r}^{\prime})% \approx\delta(\mathbf{r}-\mathbf{r}^{\prime}),

(3)

which is utilized heavily to obtain an efficient, but all-electron, picture. In addition to biorthogonality, the pseudo and all-electron partial waves are chosen to be equal outside the PAW augmentation sphere cutoff radius $\tilde{\phi}_{i}(\mathbf{r})=\phi_{i}(\mathbf{r})$ , $r>r_{c}.$

The basic recipe for converting an operator $\hat{O}$ is $\langle\psi_{n}|\hat{O}\psi_{n}\rangle=\langle\tilde{\psi}_{n}|\underbrace{% \hat{T}^{\dagger}\hat{O}\hat{T}}_{\hat{\tilde{O}}}\tilde{\psi}_{n}\rangle$ . For example, the all-electron Kohn–Sham equations

\hat{H}[n({\mathbf{r}})]|\psi_{n}\rangle=\epsilon_{n}|\psi_{n}\rangle,

(4)

where $\hat{H}$ is the single-particle all-electron Kohn–Sham Hamiltonian operator, can be transformed to their PAW counterparts:

\hat{T}^{\dagger}\hat{H}[n({\mathbf{r}})]\hat{T}|\tilde{\psi}_{n}\rangle=% \epsilon_{n}\hat{T}^{\dagger}\hat{T}|\tilde{\psi}_{n}\rangle.

(5)

We have used Eq. 1, and also multiplied with $\hat{T}^{\dagger}$ from left to make its dual space the pseudo one, i.e. $\langle\tilde{\psi}|\in\mathcal{H}^{*}$ can act from left. This results in a PAW Hamiltonian and in PAW overlap operators as follows:

$\displaystyle\hat{\tilde{H}}$	$\displaystyle=\hat{T}^{\dagger}\hat{H}[n({\mathbf{r}})]\hat{T}$
	$\displaystyle=-\frac{1}{2}\nabla^{2}+\tilde{v}_{\rm KS}(\mathbf{r})+\sum_{a}% \sum_{ii^{\prime}}\|\tilde{p}^{a}_{i}\rangle\Delta H^{a}_{ii^{\prime}}\langle% \tilde{p}^{a}_{i^{\prime}}\|,$	(6)
$\displaystyle\hat{\tilde{S}}$	$\displaystyle=\hat{T}^{\dagger}\hat{T}=1+\sum_{a}\sum_{ii^{\prime}}\|\tilde{p}^% {a}_{i}\rangle\Delta S^{a}_{ii^{\prime}}\langle\tilde{p}^{a}_{i^{\prime}}\|.$	(7)

Terms such as $\Delta H^{a}_{ii^{\prime}}$ and $\Delta S^{a}_{ii^{\prime}}$ represent so-called PAW-corrections. In each part of the description, which handles a particular kind of operators, such as kinetic energy, spin-operators, or the electrostatic potential, respective PAW-corrections must be calculated. The most crucial ones are precalculated, such as overlap, kinetic energy, Coulomb, and stored in the ‘setup’-file, which also stores the partial waves and projectors. As an example, the overlap PAW corrections are precalculated to setup as follows:

\Delta S^{a}_{ii}=\int{\rm d}{\mathbf{r}}\,\phi_{i}^{a}(\mathbf{r})\phi_{i^{% \prime}}^{a}(\mathbf{r})-\int{\rm d}{\mathbf{r}}\tilde{\phi}_{i}^{a}(\mathbf{r% })\tilde{\phi}_{i^{\prime}}^{a}(\mathbf{r}).

(8)

We further define the atomic density matrices as

D^{a}_{\sigma ii^{\prime}}=\sum_{n}f_{n}\langle\tilde{\psi}_{\sigma n}|\tilde{% p}^{a}_{i}\rangle\langle\tilde{p}^{a}_{i^{\prime}}|\tilde{\psi}_{\sigma n}\rangle.

(9)

The atomic density matrix contains all information required to construct PAW corrections to any local all-electron expectation value:

\langle O\rangle=\sum_{n}\langle\tilde{\psi}_{n}|\tilde{O}|\tilde{\psi}_{n}% \rangle+\sum_{a}\sum_{ii^{\prime}}D^{a}_{ii^{\prime}}\Delta O^{a}_{i^{\prime}i}.

(10)

The all-electron atomic density can be constructed as

n^{a}_{\sigma}(\mathbf{r})=\sum_{ii^{\prime}}D^{a}_{\sigma ii^{\prime}}\phi^{a% *}_{i}(\mathbf{r})\phi^{a}_{i^{\prime}}(\mathbf{r})+n_{\text{core}}^{a},

(11)

and the corresponding equation for pseudo-densities holds with $n\rightarrow\tilde{n}$ and $\phi\rightarrow\tilde{\phi}$ .

Since the exchange–correlation (xc) potential is non-linear, the PAW corrections must be evaluated explicitly. The xc PAW corrections are performed by constructing the atomic all-electron and pseudo-electron densities as given by Eq. 11:

	$\displaystyle\Delta H^{a}_{{\rm xc},ii^{\prime}}$	$\displaystyle=$	$\displaystyle\int{\rm d}{\mathbf{r}}\phi_{i}^{a}(\mathbf{r})v_{\rm xc}[n^{a}](% \mathbf{r})\phi_{i^{\prime}}^{a}(\mathbf{r})$		(12)
			$\displaystyle-\int{\rm d}{\mathbf{r}}\tilde{\phi}_{i}^{a}(\mathbf{r})v_{\rm xc% }[\tilde{n}^{a}](\mathbf{r})\tilde{\phi}_{i^{\prime}}^{a}(\mathbf{r}).$		(12)

This integral is numerically evaluated in a Cartesian product of a Lebedev angular grid and a non-uniform radial grid (denser mesh closer to the nucleus) for each atom.

III.2 Numerical implementation

III.2.1 Wave-function representations

GPAW supports three representations for smooth wave functions. The plane wave (PW)

\tilde{\psi}_{{\mathbf{k}}n}(r)=\sum_{\mathbf{G}}C_{n{\mathbf{G}}}e^{i({% \mathbf{k}+\mathbf{G}})\cdot\mathbf{r}}

(13)

and linear combination of atomic orbitals (LCAO)

\tilde{\psi}_{{\mathbf{k}}n}(r)=\sum_{\mu}C_{n{\mathbf{k}}\mu}\Phi_{\mathbf{k}% \mu}(\mathbf{r})

(14)

representations rely on basis functions. The finite difference (FD) mode relies on a representation of kinetic energy operator on a uniform Cartesian grid.

III.2.2 All-electron quantities

The beauty of the PAW method is that you never need to transform the pseudo wave functions to all-electron wave functions, but you can do it if you want to. GPAW has tools for interpolating the pseudo wave functions to a fine real-space grid and adding the PAW corrections. A fine real-space grid is needed to properly represent the cusp and all the oscillations necessary for the wave function to be orthogonal to all the frozen core states.

GPAW also has tools for calculating the all-electron electrostatic potential. This is useful for transmission electron microscopy (TEM) simulations [70]. Most TEM simulations have relied on the so-called independent atom model (IAM), where the specimen potential is described as a superposition of isolated atomic potentials. While this is often sufficient, there is increasing interest in understanding the influence of valence bonding [71]. This can be investigated by a TEM simulation code such as abTEM [72], which can directly use ab initio scattering potentials from GPAW.

III.2.3 Solving the Kohn–Sham equation

The default method for solving the Kohn–Sham equation for PW and FD modes is to do iterative diagonalization combined with density mixing; for LCAO mode we do a full diagonalization of the Hamiltonian. Alternatively, one can do direct minimization as described in the next section.

For PW and FD modes, we need an initial guess of the wave functions. For this, we calculate the effective potential from a superposition of atomic densities and diagnonalize an LCAO Hamiltonian in a small basis set consisting of all the pseudo partial waves corresponding to bound atomic states.

Each step in the self-consistent field (SCF) loop consists of the following operations: 1) Diagonalization of the Hamiltonian in the subspace of the current wave functions (skipped for LCAO). 2) One or more steps through the iterative eigensolver (except for LCAO, where a full diagonalization is performed). 3) Update of eigenvalues and occupation numbers. 4) Density mixing and symmetrization. See previous work [32, 33] and Ref. [73] for details.

GPAW has two kinds of Poisson equation solvers: direct solvers based on Fourier transforms or Fourier-sine transforms, and iterative multi-grid solvers (Jacobi or Gauss-Seidel). The default is to use a direct solver whereas the iterative solvers may be chosen for larger systems where they can be more efficient.

For 0-, 1- and 2-dimensional systems, the default boundary conditions are to have the potential go to zero on the cell boundaries. This becomes a problem for systems involving large dipole moments. The potential due to the dipole is long-ranged and, thus, the converged potential requires large vacuum sizes. For molecules (0D systems), the boundary conditions can be improved by adding multipole moment corrections to the density so that the corresponding multipoles of the density vanish. The potential of these corrections is added to the obtained potential. The same trick is used to handle charged systems. For slabs (2D systems), a dipole layer can be added to account for differences in the work functions on the two sides of the slab.

Methods for calculating occupation numbers are the Fermi-Dirac, Marzari-Vanderbilt [74] and Methfessel-Paxton distributions as well as the tetrahedron method and the improved tetrahedron method [75].

III.2.4 Updating wavefunctions in dynamics

Simulations commonly move the atoms without changing other parameters. If an atom moves only slightly, we would expect most of the charge in its immediate vicinity to move along with it. We use this to compute an improved guess for the wave functions in the next self-consistency loop with FD or PW mode where the eigensolver is iterative.

Near the atoms, the dual basis of pseudo-partial waves and projectors is almost complete, i.e.

\sum_{i}|\phi_{i}^{a}\rangle\langle\tilde{p}_{i}^{a}|=1\quad\textrm{(near atom% {\hbox{a}})}.

(15)

If an atom moves by $\Delta\mathbf{R}^{a}$ , the wave functions $\tilde{\psi}_{n}(\mathbf{r})$ are updated by rigidly moving the projection $\sum_{i}\tilde{\phi}_{i}^{a}(\mathbf{r})\langle\tilde{p}_{i}^{a}|\tilde{\psi}_% {n}\rangle$ along with it, i.e.,

	$\displaystyle\tilde{\psi}_{n}^{\mathrm{new}}(\mathbf{r})=\tilde{\psi}_{n}(% \mathbf{r})$	$\displaystyle+$	$\displaystyle\sum_{ai}\tilde{\phi}_{i}^{a}(\mathbf{r}-\Delta\mathbf{R}^{a})% \langle\tilde{p}_{i}^{a}\|\tilde{\psi}_{n}\rangle$		(16)
		$\displaystyle-$	$\displaystyle\sum_{ai}\tilde{\phi}_{i}^{a}(\mathbf{r})\langle\tilde{p}_{i}^{a}% \|\tilde{\psi}_{n}\rangle.$		(16)

As the partial waves on different atoms are not orthonormal, this expression generally “double-counts” contributions, resulting in wave functions that are to some extent unphysical. Nevertheless, we have found that this simple method achieves a significant speedup ( $\sim$ 15% in realistic structure optimisations) compared to not updating the wave functions.

The method could be further improved by using the LCAO basis set and the overlap matrix to prevent double-counting.

III.2.5 Direct minimization

Direct orbital minimization [76, 77, 78, 79] is a robust alternative to the conventional eigensolver and density mixing routines. The orbitals can be expressed as a unitary transformation of a set of reference, or auxiliary, orbitals $\mathbf{\Psi}_{0}$ :

\mathbf{\Psi}=\mathbf{U}\mathbf{\Psi}_{0}

(17)

In the direct minimization (DM) method implemented in GPAW [80, 50], the unitary matrix $\mathbf{U}$ is parametrized as an exponential transformation, i.e. $\mathbf{U}=e^{\mathbf{A}}$ , where ${\mathbf{A}}$ is an anti-Hermitian matrix ( ${\mathbf{A}}=-{\mathbf{A}^{\dagger}}$ ). The energy can be considered as a functional of both $\mathbf{A}$ and $\mathbf{\Psi}_{0}$ :

E[\mathbf{\Psi}]=\mathcal{F}[\mathbf{A},\mathbf{\Psi}_{0}]

(18)

Thus, in general, the optimal orbitals corresponding to the minimum of the energy functional can be found in a dual loop procedure. First, the energy is minimized with respect to the elements of $\mathbf{A}$ , and second, the functional $\mathcal{L}[\mathbf{\Psi}_{0}]=\underset{\mathbf{A}}{\rm min\,}\mathcal{F}[% \mathbf{A},\mathbf{\Psi}_{0}]$ is minimized with respect to $\mathbf{\Psi}_{0}$ :

\underset{\mathbf{\Psi}}{\rm min\,}E[\mathbf{\Psi}]=\underset{\mathbf{\Psi}_{0% }}{\rm min\,}\underset{\mathbf{A}}{\rm min\,}\mathcal{F}[{\mathbf{A}},\mathbf{% \Psi}_{0}]

(19)

Since anti-Hermitian matrices form a linear space, the inner loop minimization can use well-established local minimization strategies such as efficient quasi-Newton methods with inexact line search, e.g. the limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) algorithm. The outer loop minimization follows the gradient $\partial\mathcal{L}/\partial\mathbf{\Psi}_{0}$ projected on the tangent space at $\mathbf{\Psi}_{0}$ .

The GPAW formulation of DM is applicable with all representations of the orbitals available in GPAW, as well as Kohn–Sham (unitary invariant) and orbital density dependent (nonunitary invariant) energy functionals, and can be used for both finite and extended systems. In LCAO calculations, the reference orbitals are expressed as a linear combination of the atomic basis functions $\mathbf{\Phi}$ , $\mathbf{\Psi}_{0}=\mathbf{\Phi}\mathbf{C}_{0}$ , where the matrix of coefficients $\mathbf{C}_{0}$ is fixed. Therefore, only a minimization with respect to the elements of the matrix $\mathbf{A}$ is required [80]. For plane wave and real-space grid representations, a minimization in the space tangent to the reference orbitals is sufficient if the functional is unitary invariant. Otherwise, if the functional is nonunitary invariant, such as when self-interaction correction is used (see section III.3.5), an inner loop minimization in the occupied–occupied block of the matrix $\mathbf{A}$ is performed to make the energy stationary with respect to unitary transfomation of the occupied orbitals [50].

The DM method avoids diagonalization of the Hamiltonian at each step and as a result it usually involves a smaller computational effort. The DM method has also been shown to be more robust than conventional eigensolvers and density mixing in calculations of molecules and extended systems [80]. However, the current implementation does not support finite-temperature distribution of occupation numbers and thus can only be used for systems with a finite band gap.

III.2.6 Convergence criteria

The modular architecture of GPAW allows the user to have precise control over how the SCF loop decides that the electronic structure has converged to sufficient precision. GPAW contains simple keywords for common convergence criteria, such as “energy”, “forces”, (electron) “density”, and “work function”, which are sufficient for the most common use cases.

Internally, all convergence criteria are instances of convergence classes, and for each step through the SCF loop, each convergence class is called. When any convergence class is called, it is passed a context that contains the current state of the calculation, such as the wavefunctions and the Hamiltonian. The criterion can thus pull relevant data from the calculation to decide if it is converged. Because the convergence criterion itself is an object, it can store information, such as previous values of the energy for comparison to the new value. When all convergence criteria report that they are converged, the calculation as a whole is considered to be converged and terminates.

This modular nature gives the user full control over how each convergence criterion operates. For example, the user can easily ask the energy criterion to check the differences in the last four values of the energy, rather than the last three. If a convergence criterion itself is expensive to calculate, it can make sense to not check it until the rest of the convergence criteria are met. This can be accomplished by activating an internal “calculate_last” flag within the convergence criterion.

Users can easily add their own custom convergence criteria to the SCF loop. If a user would like to use a criterion not included by default with GPAW, it is straightforward to write a new criterion as a Python class, and pass this to the convergence dictionary of GPAW. For example, if one wanted to be sure the bandgap of a semiconductor was converged, the criterion could check the bandgap at each iteration and compare it to stored values from previous iterations, and report that the calculation is converged when the peak-to-peak variance among the last $n$ iterations is below a given threshold.

III.2.7 PAW data sets and pseudopotentials

GPAW can read PAW datasets from (possibly compressed) XML-files following the PAW-XML specification [81]. Dataset files for most of the periodic table can be downloaded from the GPAW webpage or installed with the gpaw install-data command-line tool. The datasets are available for the LDA, PBE, revPBE, RPBE and GLLBSC functionals. The electronic structure code Abinit [82] also reads the PAW-XML format allowing GPAW and Abinit to share PAW dataset collections such as the Jollet-Torrent-Holzwarth collection [83].

Specialized datasets can be generated with the gpaw dataset command-line tool. This allows one to tweak the properties of a dataset. Some examples could be: 1) add more/less semi-core states; 2) increase/decrease the augmentation sphere radius to make the pseudo wave functions more/less smooth; 3) add/remove projector functions and corresponding pseudo and all-electron partial waves; or 4) base the PAW dataset on a different XC-functional. These changes will affect the accuracy and cost of the calculations.

GPAW is also able to use norm-conserving pseudopotentials (NCPP) such as HGH [84] and pseudopotentials in the UPF format, such as SG15 [85]. Non-local NCPPs can be considered an approximation to PAW: in the PAW description, the non-local part of the Hamiltonian (the term containing $\Delta H_{ij}^{a}$ in Eq. (6)) will adapt to the environment, whereas for a NCPP, $\Delta H_{ij}^{a}$ will be diagonal and have a fixed value taken from a reference atom. Because of the norm-conservation, NCPPs will have $\Delta S_{ij}^{a}=0$ .

III.2.8 Parallelization

GPAW can parallelize over various degrees of freedoms depending on the type of calculation, and implements multiple algorithms for achieving good parallel performance and scalability. In calculations involving k-points, parallelization over them is typically the most efficient, as little communication is needed during the summation of wave functions to calculate the density and any derived quantities. As the number of k-points is often limited especially in large systems, parallelization is also possible over real-space grids in the FD and LCAO modes, as well as over plane waves in the PW mode. All modes also support parallelization over electronic bands, which is particularly efficient for real-time TDDFT where the time-propagation of each band can be carried out independently. Additional parallelization possibilities exist depending on calculations, such as over electron–hole pairs in linear-respose TDDFT calculations.

Parallelization is done mainly with MPI. In the FD and LCAO modes, it is possible in addition to use also OpenMP within shared-memory nodes, which can improve performance when the number of CPU cores per node is large. Dense linear algebra, such as matrix diagonalization and Cholesky decomposition, can be carried out with the parallel ScaLAPACK or ELPA libraries. This applies to both the direct diagonalization in the LCAO mode as well as to subspace diagonalizations in iterative Davidson and RMM-DIIS methods in the FD and PW modes.

For ground-state calculations, GPAW will divide the cores into three MPI-communicators: k-points, bands and domain. When parallelizing over k-points and/or bands, all the cores of the k-point and/or band communicators will have a copy of the density (possibly distributed over the domain communicator). GPAW has the option to redistribute the density from the domain-communicator to all cores so that operations such as evaluating the XC-energy and solving the Poisson equation can be done more efficiently.

For each k-point, all the cores in the band and domain communicators will cooperate on calculating matrix elements of the Hamiltonian and the overlap operators. Dense-matrix linear-algebra operations on those matrices can be done on a single core (most efficient for small systems) or with ScaLAPACK where the matrices are distributed over either all or some of the cores from the pool of cores in the band and domain communicators.

One drawback of Python is that in large parallel calculations its import mechanism may incur a heavy load on the filesystem because all the parallel processes are trying to read the same Python files. GPAW tries to alleviate this by a special “broadcast imports” mechanism: during initial module imports only a single process loads the modules, afterwards MPI is used to broadcast the data to all the processes.

Parallel scalability depends strongly on the calculation mode and the system. The FD mode offers the best scalability for high core counts as only nearest-neighbour communication is needed across domains over a domain decomposition. In PW mode, the limiting factor is all-to-all communication in parallelization over plane waves. In LCAO mode, communications arise from multi-center integrals of basis functions across domains. At best, GPAW scales to tens or hundreds of nodes in supercomputers.

III.2.9 GPU implementation

The GPU implementation of GPAW works both on NVIDIA and AMD GPUs, targeting either CUDA or HIP backends, respectively. GPAW uses a combination of manually written GPU kernels, external GPU libraries (such as cuBLAS / hipBLAS), and CuPy [86]. CuPy offers an easy-to-use Python interface for GPUs centered around a NumPy-like GPU array and makes many hardware details completely transparent to the end-user.

In the manually written GPU kernels, both GPU backends (CUDA and HIP) are targeted using a header-only porting approach [87] in which generic GPU identifiers are translated to vendor-specific identifiers at compile time. For example, to allocate GPU memory, the identifier gpuMalloc is used in the code, that is then translated either to cudaMalloc or hipMalloc depending on which GPU backend is targeted. This allows us to avoid unnecessary code duplication and still target multiple hardware platforms natively.

An earlier GPU implementation of GPAW [88, 89] served as the starting point for the recent work on a new GPU code based on the rewritten ground-state code. The objects that store quantities like $\tilde{\psi}_{n}(\mathbf{r})$ , $\tilde{p}_{i}^{a}(\mathbf{r})$ , $\langle\tilde{p}_{i}^{a}|\tilde{\psi}_{n}\rangle$ , $D_{ii^{\prime}}^{a}$ , $\tilde{n}(\mathbf{r})$ and $\tilde{v}(\mathbf{r})$ , use Numpy arrays for the CPU code and CuPy arrays when running on a GPU. At the moment, GPUs can be used for total-energy calculations with LDA/GGA in the PW mode.

Parallelization to multiple GPUs is done using MPI. Each MPI rank is assigned a single GPU and communication between the GPUs is handled by MPI. Support for GPU-aware MPI makes it possible to do direct GPU-to-GPU communication without unnecessary memory copies between the GPU device and the host CPU.

III.3 XC-functionals

Exchange–correlation (XC) functionals provide a mapping between the interacting and the non-interacting system of electrons. In Kohn–Sham DFT, the density is built from a set of occupied non-interacting single-particle orbitals $\psi_{i}$ : $n=\sum_{i}f_{i}\left|\psi_{i}\right|^{2}$ , with $f_{i}$ denoting the occupation numbers, leading to the same density as the interacting system. The total energy in DFT is expressed as a sum of density functionals for the different contributions:

\displaystyle E_{\mathrm{tot}}[n]=T_{\mathrm{S}}[n]+V_{\mathrm{ext}}[n]+U_{% \mathrm{H}}[n]+E_{\mathrm{xc}}[n],

(20)

where $T_{\mathrm{S}}[n]$ denotes the kinetic energy of the non-interacting system, $V_{\mathrm{ext}}[n]$ the energy of the density in the external potential, $U_{\mathrm{H}}[n]$ the classical Coulomb energy of the density with itself, and $E_{\mathrm{xc}}[n]$ the so called exchange–correlation energy, which collects all energy contributions missing in the prior terms and therefore provides a mapping between the interacting and the non-interacting system of electrons. While the first three terms can be calculated exactly, even the form of $E_{\text{xc}}$ is unknown and although proven to exist and be exact in principle, it has to be approximated in practice. A huge number of approaches belonging to several families exists [1, 2, 90]. Several of these approximations are available in GPAW, and an overview is given in the following.

III.3.1 Libxc and Libvdwxc

The libxc library [65] provides implementations of several (semi-)local variants of the XC functional, given by the LDA, GGA, and MGGA families. These are available in GPAW by a combination of their names from libxc as, e.g. “GGA_X_PBE+GGA_C_PBE” for PBE [3]. Additionally, GPAW provides its own implementation of several (semi-)local functionals called by their short-names, e.g. TPSS [91], PBE [3] and LDA, the latter with the correlation of Perdew and Wang [92]. Several hybrids, see below, are implemented in GPAW with the support of the libxc library for their local parts.

For fully non-local van der Waals functionals, like the vdW-DF functional [31], GPAW uses the efficient fast Fourier-transform convolution algorithm by Roman-Perez and Soler [93] as implemented in the libvdwxc library [66].

III.3.2 GLLB-sc

GPAW has an implementation of the solid-state modification of the Gritsenko–van Leeuwen–van Lenthe–Baerends exchange–correlation model potential (GLLB-sc) [94, 95]. This potential has been shown to improve the description of the band gap [16] and the $d$ -electron states in noble metals [52, 96, 97, 98].

III.3.3 Hubbard U

DFT+U calculations using a Hubbard-like correction can be performed in GPAW to improve the treatment of Coulombic interactions of localized electrons. This correction is most commonly applied to the valence orbitals of transition metals to assist with obtaining experimental band gaps of oxides [99], which may otherwise be underestimated [100, 101]. Also, formation energies and magnetic states are often improved due to a more correct description of the electronic structure. It may also be applied to main-group elements such as N, O, P, or S, but this is less commonly done [102].

In the formalism chosen in GPAW [103], one uses a single $U^{\text{eff}}$ parameter rather than seperate $U$ and $J$ parameters for on-site Coulombic and on-site exchange, respectively [104]. The correction influences the calculation by applying an energy penalty to the system:

U^{\text{eff}}=U-J,

(21)

E_{\text{DFT+U}}=E_{\text{DFT}}+\sum_{a,i}\frac{U^{\text{eff}}_{i}}{2}\textup{% Tr}\left(\rho^{a}_{i}-\rho^{a}_{i}\rho^{a}_{i}\right),

(22)

where the sum runs over the atoms $a$ and orbitals $i$ for which the correction should be applied. $E_{\text{DFT}}$ is calculated by a standard GPAW calculation and is corrected to $E_{\text{DFT+U}}$ by penalizing the energy such that fully occupied or fully unoccupied orbitals are stabilized. The magnitude of the correction depends on $U^{\text{eff}}_{i}$ , and the atomic orbital occupation matrix ( $\rho^{a}_{i}$ ) controls which orbitals contribute to the correction based on their occupation. In principle, any orbital on any element that is partially occupied can be corrected.

GPAW supports normalization of the occupation matrix, accounting for the truncated portion of the wave function outside the augmentation sphere. To maintain consistency with other codes that do not support normalization, this normalization can be disabled, but large disagreements are expected when applied to $p$ orbitals. GPAW is one of the few codes that currently supports multiple simultaneous corrections to orbitals on the same atom; this is useful when two types of orbitals, such as $p$ and $d$ orbitals, are nearly degenerate but both are partially occupied.

There is no $U^{\text{eff}}$ that is strictly correct, but methods such as RPA [105] or linear response [106, 107] can allow for a value to be calculated from first principles. More commonly, $U^{\text{eff}}$ is chosen semi-empirically to fit experimental properties such as formation energy [108], band gap [109], or more recently by machine learning predictions [110].

III.3.4 Hybrids

Hybrid functionals, especially range-separated functionals (see below), correct problems present in calculations utilizing (semi-)local functionals such as the wrong asymptotic behavior of the effective potential leading to an improper description of Rydberg excitations [111], the improper calculation of the total energy against fractional charges leading to the charge delocalization error [112], and the wrong description of (long-range) charge transfer excitations due to the locality of the exchange hole [113, 114, 115].

The exchange–correlation energy $E_{\mathrm{xc}}$ , can be split into the contributions from exchange, $E_{\mathrm{x}}$ , and from correlation, $E_{\mathrm{c}}$ [2, 116]. Hybrid functionals combine the exchange from the (semi-)local functionals with the exchange from Hartree–Fock theory (HF). Global hybrids such as PBE0 [117] mix the exchange from DFT with the exchange from HF by a global fixed amount. Range-separated functionals (RSF) add a separation function $\omega_{\mathrm{RSF}}$ [118, 119] to express the Coulomb kernel in the exchange integrals as

	$\displaystyle\frac{1}{r_{12}}$	$\displaystyle=\underbrace{\frac{1-\left[\alpha+\beta\left(1-\omega_{\mathrm{% RSF}}\left(\gamma,r_{12}\right)\right)\right]}{r_{12}}}_{\mathrm{SR}}$
		$\displaystyle+\underbrace{\frac{\left[\alpha+\beta\left(1-\omega_{\mathrm{RSF}% }\left(\gamma,r_{12}\right)\right)\right]}{r_{12}}}_{\mathrm{LR}},$		(23)

where $r_{12}=|\vec{r}_{1}-\vec{r}_{2}|$ . Here $\alpha$ and $\beta$ are mixing parameters for global and range-separated mixing, respectively. $\omega_{\mathrm{RSF}}$ is a soft function with values ranging from one for $r_{12}=0$ to zero for $r_{12}\to\infty$ , where the decay is controlled by the parameter $\gamma$ . Long-range-corrected RSFs such as LCY-PBE [120] use (semi-)local approximations for short-range (SR) interaction and apply HF exchange for long-range (LR) interaction. Short-range-corrected RSFs such as HSE [5] reverse this approach. The parameter $\gamma$ is either fixed or can be varied to match criteria of the ideal functional, e.g. that the energy of the highest occupied molecular orbital matches the ionization potential [121, 116].

Details on the FD-mode implementation of long-range corrected RSF can be found in Refs. [122, 123]. In general, the FD-mode implementation of hybrids is limited to molecules, and forces have not been implemented. The PW-mode implementation of hybrids handles $\mathbf{k}$ -points, exploits symmetries and can calculate forces.

III.3.5 SIC

A fully self-consistent and variational implementation of the Perdew-Zunger self-interaction correction [124] (PZ-SIC) is available in GPAW. It corrects the various problems with (semi-)local Kohn–Sham functionals mentioned above in the context of hybrid functionals. Atomic forces are available in the GPAW implementation with all three types of basis sets. A corrected KS functional has the form

E^{\mathrm{SIC}}[n_{1},\dots,n_{n}]=E^{\mathrm{KS}}[n]-\alpha\sum_{i}^{\mathrm% {occ}}\left(U_{\mathrm{H}}[n_{i}]+E_{\mathrm{xc}}[n_{i},0]\right)

(24)

where the self-Coulomb and self-XC of each occupied orbital density is subtracted from the Kohn–Sham energy functional. Due to the explicit dependence on the orbital densities, the corrected energy functional is not invariant under a unitary transformation among occupied orbitals and thereby not a KS functional. As a result, the minimization of $E^{\mathrm{SIC}}$ requires special direct minimization techniques (see section III.2.5) and delivers a specific set of (typically localized) optimal orbitals. The calculations should be carried out using complex orbitals [125, 126, 127]. The full PZ-SIC has been shown to give an over-correction to binding energy as well as band gaps and improved results for these properties are obtained by scaling the SIC by $\alpha=1/2$ [128, 129], while the long-range form of the effective potential, necessary for Rydberg state calculations, requires the full correction [130].

PZ-SIC has been shown to give accurate results in cases where commonly used KS functionals fail. This includes, for example, the Mn dimer where the PBE functional gives qualitatively incorrect results while the corrected functional gives close agreement with high-level quantum chemistry results as well as experimental measurements [131]. Another example is the defect state of a substitutional Al atom in $\alpha$ -quartz [132]. Also, PZ-SIC has been shown to improve values of the excitation energy of molecules obtained in variational calculations of excited states [50, 133] (see section VIII.2).

III.3.6 BEEF

A great strength of Kohn–Sham DFT and its extensions is that a reasonably high accuracy of physical, material, and chemical properties can be obtained at a relatively moderate computational cost. DFT is thus often used to simulate materials, reactions, and properties, where there de facto exists no “better” alternative. Even though more accurate electronic structure methods in principle might exist for a given application, the poor scaling of systematically more accurate methods often makes these computationally infeasible at the given system size that is being studied using DFT. One is therefore often in the situation that the accuracy of a given DFT calculation of some materials or chemical property cannot be verified against e.g. a more accurate solution of the Schrödinger equation, even on the biggest available supercomputers.

On the other hand, the wealth of available XC functionals naturally allows one to look at how sensitive a DFT result is on the choice of functional, and often accuracy is therefore being judged primarily by applying a small set of different XC functions, especially if no accurate benchmark theoretical simulation or experimental measurement is available. A challenge, however, is that the different available functionals are often known to be particularly good at simulating certain properties and poor at others. It is thus not at all clear how much one should trust a given functional for a given simulated property. The Bayesian Error Estimation (BEE) class of functionals [134] attempts to develop a practical framework to establishing an error estimate from a selected set of functional “ingredients”.

Assume that the XC model $M(a)$ is a function of a set of parameters, $a$ , that can be varied freely. If a benchmark data set, $D$ , of highly accurate properties established from experiments or higher-accuracy electronic structure simulations is available, we can attempt to identify the ensemble of models given by some distribution function, $P(M(a))$ , such that the most likely model in the ensemble, $M(a_{0})$ , makes an accurate predictions for the benchmark data set, while the spread of ensemble reproduces the spread between the predictions of the most likely model and the benchmark data.

Bayes’ theorem provides a natural framework to search for the ensemble distribution. If a joint distribution between $M(a)$ and $D$ exists, which we would assume, then Bayes’ theorm gives

P(M(a)|D)\propto P(D|M(a))P_{0}(M(a)),

(25)

where $P_{0}(M(a))$ is the prior expectation to the distribution of models before looking at the data, $D$ , and $P(D|M(a))$ is a likelihood of seeing the data given the model. To achieve a useful ensemble, much care has to be put into finding a large enough set of varied and accurate data for different materials and chemical properties, and much care has to be applied as to how the ensemble is regularized to avoid overfitting [135, 136].

In all the BEE functionals, choices are made such that ultimately the XC functional is linear in $a$ , and such that the distribution of $a$ ends up following a multidimensional Normal distribution given by a regularized covariance matrix $\Gamma$ :

P(a)\propto\exp(-(a-a_{0})\Gamma(a-a_{0})),

(26)

where $\Gamma$ has been scaled in such a way that the ensemble reproduce the observed standard deviation between $M(a_{0})$ and $D$ .

Figure 2 shows an ensemble of exchange enhancement factors $F_{x}(s)$ from the BEEF-vdW functional, where $s=|\nabla n|/(2k_{F}n)$ is the reduced density gradient. The approach has given rise to several functionals BEEF-vdW [135], mBEEF [136], and mBEEF-vdW [137], which all include error estimation and are readily available in GPAW. Through the ASE interface, one can for example utilize the error ensembles to establish error estimates on Python-implemented models that are using DFT simulations for their parametrization. An example of this is the application to error estimates on adsorption scaling relations, microkinetics, and materials selection [138].

One risk to the approach of establishing error estimates from a small selected “ensemble” of XC-functionals is clearly that if the simulated property in question is poorly described by all functionals in the ensemble, then the error estimate might be also become poor. This could for example be the case if one tried to establish an error estimate for band gaps in oxides or van der Waals bonding of adsorbates on surfaces based on an ensemble of GGA XC-functionals, since no GGA functional may be accurate for simulating either property.

IV Ion dynamics

GPAW can be employed as a ‘black-box’ calculator, supplying energies and forces to other programs such as e.g. ASE, which then perform optimization of ground state geometries and reaction paths or carry out molecular dynamics. In fact, this is a key design principle behind GPAW. Methodological developments and general implementations that are not directly dependent on fast access to detailed electronic structure information should preferably be implemented externally to GPAW. This lead to maximal simplicity of the GPAW code itself, while also allowing for the external code to be utilized and tested with other electronic structure codes and atomistic potentials. A key to the high efficiency of GPAW simulations involving ionic displacements is the versatile implementation of constraints in ASE. Here many types of constraints are readily accessible, from the simple removal of degrees of freedom to more exotic constraints allowing for rigid molecule dynamics [139], to harmonic restoring forces [140], spacegroup preservation and combined ionic–unit cell dynamics [141]. Many algorithms are available for various structure optimization and molecular dynamics tasks.

IV.1 Structure relaxation

Local structure optimization in GPAW is typically achieved through the use of an optimizer from ASE. Here a larger range of standard optimizers are available such as quasi-Newton algorithms including BFGS [142, 143, 144, 145] and limited-memory BFGS [146], or Newtonian dynamics-based algorithms such as MDMin and FIRE [147]. That the optimizers have been implemented externally to GPAW provides benefits in terms of simple procedures for restarting long simulations and monitoring their convergence. Some optimizers from SciPy [68] are also available through the open-source ASE package, which provides a simple way to interface any optimizer in the SciPy format to GPAW. Preconditioning is implemented in an accessible way [148], which often leads to significant performance improvements.

Of the classical optimization methods, the quasi–Newton algorithms are often highly competitive. Here one builds up information on the Hessian or inverse Hessian matrix from calculated forces, ultimately leading to an accurate harmonic model of the potential energy surface in the vicinity of the local minimum. Such algorithms can, however, have problems both dealing with anharmonicity in the potential energy surface and with any noise in the electronic structure simulations. It often makes sense instead to fit a Gaussian process to the calculated energies and forces, and minimize within this model potential. This is implemented as the so-called GPmin method in ASE and often converges on the order of three times faster than the best quasi–Newton optimizers. [149]

IV.2 Reaction paths and barriers

Reliable calculations of energy barriers are of key importance for determining the rate of atomistic processes. In many quantum-chemical codes utilizing accurate atom-centered basis functions, this is achieved using analytical second derivatives in the direct search for first-order saddle points. This approach is less useful in the plane-wave based or grid-based modes of GPAW. The Dimer method [150] is implemented in ASE and can be used with GPAW, but often one would like to have an overview of the entire reaction path of an atomic-scale process to verify that the correct energy barrier for a process has been determined, and to obtain a full overview of the atomistic mechanism. For this purpose, the Nudged Elastic Band (NEB) method is typically employed through the ASE interface to GPAW. Both the original method [151] and a range of later improvements are available [152, 153, 154, 155, 156, 157]. Special care has to be taken in selecting the optimizer for a NEB algorithm, as this choice can have drastic influence on the convergence rate. For optimization of the reaction path, drastic performance improvements can be obtained by carrying out the optimization in a surrogate machine-learning model fitted to the potential energy surface [158, 159, 160]. GPAW has been used to drive NEB calculations on both surface systems [161, 162, 163, 164] as well as in molecules [164, 165].

IV.3 Global structure optimization

GPAW is integrated with various tools for global optimization of structures, compositions, as well as materials morphologies. Some quite generally applicable global optimization tools available are Basin Hopping [166], Minima Hopping [167], and Genetic Algorithms [168]. Some of the most powerful global optimization problems addressed using GPAW rely on machine-learning accelerated global-optimization strategies. These strategies have for example been applied to surfaces, clusters [169], and crystal structures in general [170]. In other machine-learning accelerated global-optimization routines, GPAW was used to generate initial databases of surface and bulk systems, and for later model validation in a strategy that uses Gaussian processes to generate surrogate potential-energy surfaces. These were then explored with Bayesian-optimization techniques, achieving speed-ups over more conventional methods of several orders of magnitude in finding optimal structures of the systems under investigation [171, 172, 173]. The method has been augmented through introducing extra (hyper)dimensions by interpolating between chemical elements, which speeds up the efficiency of the global search [174]. GPAW has been integrated with a Covariance Matrix Adaptation Evolution Strategy (CMA-ES) framework, providing energies and forces, generating training data, and evaluating CMA-ES candidate structures [175].

IV.4 Molecular dynamics and QM/MM

Ab initio molecular dynamics (MD) can be performed and analyzed through the ASE interface to GPAW. This include standard simulations such as constant-NVE, constant-NVT, and constant-NPT simulations. There is access to e.g. Langevin-, Andersen-, Nose-Hoover-, and Berendsen-dynamics. Due to the externalization of the dynamics, the development of novel algorithms and analysis tools becomes facile. Some examples of the use of GPAW span ab initio MD studies that have explored the liquid structure of water [176] and the water/Au(111) electrochemical interface [177].

Furthermore, GPAW is capable of working with external electrostatic potential terms from user-supplied point charge values and positions, enabling Quantum Mechanical (QM) / Molecular Mechanical (MM) simulations. Since GPAW is from the outset designed around highly efficient grid operations, the computational overhead of evaluating this potential as well as the resulting forces on the MM point charges from the QM density is kept low [178]. GPAW has been central in a range studies of ion dynamics in solution, both in and out of equilibrium. By using the molecular-dynamics functionality of ASE, researchers have performed QM/MM MD simulations of excited-state bond formation in photocatalyst model systems [179, 180, 181, 182], and electron transfer as well as coupled solute-solvent conformational dynamics in photosensitizer systems [183, 165]. Work on polarizable embedding QM/MM within GPAW is ongoing [184, 185].

V Magnetism and spin

Many important technological applications utilize magnetic order or manipulation of spin in materials. GPAW has a wide range of functionalities that faciliate the analysis of magnetic properties. This includes calculations with non-collinear spin, inclusion of external magnetic fields, spin–orbit coupling and spin spiral calculations within the generalized Bloch theorem. The implementaton of these features is described below while additional methods for calculating magnetic excitations are described in section VI.4.

V.1 Spin–orbit coupling

The spin–orbit coupling is typically completely dominated by regions close to the nuclei where the electrostatic potential becomes strong. Within the PAW augmentation sphere of atom $a$ , the spin- $\sigma$ component of orbital $n$ is given by

\psi_{n\sigma}(\mathbf{r})=\sum_{i}\langle\tilde{p}^{a}_{i}|\tilde{\psi}_{n% \sigma}\rangle\phi_{i}^{a}(\mathbf{r}).

(27)

Assuming a spherically symmetric potential, the spin–orbit Hamiltonian for atom $a$ is then written as

\hat{H}_{\mathrm{SOC}}^{a}=\frac{\mu_{\mathrm{B}}}{\hbar m_{\mathrm{e}}ec^{2}}% \frac{1}{r}\frac{\mathrm{d}V_{r}^{a}}{\mathrm{d}r}\hat{\mathbf{L}}\cdot\hat{% \mathbf{S}},

(28)

where $V_{r}^{a}$ is the radial electrostatic potential of atom $a$ . We evaluate $V_{r}^{a}$ as the spherical part of the XC- and Hartree-potential from the local expansion of the density given by Eq. 11. Since the partial waves $\phi^{a}_{i}$ are eigenstates of the scalar-relativistic Hamiltonian, these are independent of spin and it is straightforward to evaluate the action of $\hat{\mathbf{L}}$ on them.

The eigenenergies can be accurately calculated in a non-selfconsistent treatment of the spin–orbit coupling and may be obtained by diagonalizing the full Hamiltonian in a basis of scalar-relativistic orbitals [186]:

H_{mn}=\varepsilon^{0}_{m}\delta_{mn}+\sum_{aij\sigma\sigma^{\prime}}\langle% \tilde{\psi}_{m\sigma}^{0}|\tilde{p}^{a}_{i}\rangle\langle\phi_{i}^{a}|\hat{H}% ^{a}_{\mathrm{SOC,\sigma\sigma^{\prime}}}|\phi_{j}^{a}\rangle\langle\tilde{p}^% {a}_{j}|\tilde{\psi}_{n\sigma^{\prime}}^{0}\rangle.

(29)

Here $\varepsilon^{0}_{m}$ and $\psi_{m}^{0}$ represent the scalar-relativistic eigenenergies and eigenstates, respectively. This constitutes a fast post-processing step for any scalar-relativistic calculation and only requires the projector overlaps $\langle\tilde{p}^{a}_{j}|\tilde{\psi}_{n\sigma^{\prime}}^{0}\rangle$ . It should be noted that this approach in principle requires convergence with respect to the number of scalar-relativistic states included in the basis, but the eigenvalues typically converge rather rapidly with respect to the basis. In Fig. 3, we show the band structure of a WS₂ monolayer obtained from PBE with non-self-consistent spin–orbit coupling. The W atoms introduce strong spin–orbit coupling in this material and the valence band is split by 0.45 eV at the K point. The spin degeneracy is retained along the $\Gamma$ -M line, which is left invariant by two non-commuting mirror symmetries.

For magnetic materials the non-selfconsistent treatment of spin–orbit coupling is convenient for evaluating the magnetic anisotropy. The magnetic force theorem [187] implies that rotating the magnetic moments away from the ground state configuration yields a contribution to the energy, which is well approximated by the change in Kohn–Sham eigenvalues. The change in energy for a given orientation of the magnetization density can thus be obtained as

\Delta E^{\theta,\varphi}=\sum_{n}f_{n}^{\theta,\varphi}\varepsilon_{n}^{% \theta,\varphi},

(30)

where $\varepsilon_{n}^{\theta,\varphi}$ are the eigenvalues obtained from diagonalizing Eq. (29) with the spins rotated to a direction defined by the angles $(\theta,\varphi)$ and $f_{n}^{\theta,\varphi}$ are the associated occupation numbers. This corresponds to rotating the xc-magnetic field (defined below), which will lead to different eigenvalues when spin–orbit coupling is included. In two-dimensional magnets, an easy-axis anisotropy is decisive for magnetic order [188, 189, 41] and Eq. (30) is easily applied to high throughput computations of magnetic properties [190, 191].

V.2 Self-consistent non-collinear magnetism

The Kohn–Sham framework for treating non-collinear magnetism was developed by Barth and Hedin [192] and involves the spin density matrix

\rho_{\sigma\sigma^{\prime}}(\mathbf{r})=\sum_{n}f_{n}\psi^{*}_{n\sigma}(% \mathbf{r})\psi_{n\sigma^{\prime}}(\mathbf{r})

(31)

as the basic variable. The electronic density and magnetization are then given by $n=\mathrm{Tr}[\rho]$ and $\mathbf{m}=\mathrm{Tr}[\boldsymbol{\sigma}\rho]$ respectively. The XC-potential acquires four components, which are the functional derivatives of the XC-energy with respect to the density matrix and may be represented as a $2\times 2$ matrix acting on spinor Kohn–Sham states. These can be expressed in terms of the density and magnetization, which lead to the XC-part of the Kohn–Sham Hamiltonian

H_{\mathrm{xc}}=v_{\mathrm{xc}}+\mathbf{B}_{\mathrm{xc}}\cdot\boldsymbol{% \sigma},

(32)

where the scalar potential and XC-magnetic field are given by

v_{\mathrm{xc}}(\mathbf{r})=\frac{\delta E_{\mathrm{xc}}}{\delta n(\mathbf{r})% }\qquad\mathbf{B}_{\mathrm{xc}}(\mathbf{r})=\frac{\delta E_{\mathrm{xc}}}{% \delta\mathbf{m}(\mathbf{r})}.

(33)

In GPAW, the self-consistent treatment of non-collinear spin is implemented within LDA where $\mathbf{B}_{\mathrm{xc}}$ is approximated as

\mathbf{B}_{\mathrm{xc}}^{\mathrm{LDA}}(\mathbf{r})=\frac{\delta E_{\mathrm{xc% }}^{\mathrm{LDA}}}{\delta m(\mathbf{r})}\mathbf{\hat{m}}(\mathbf{r}).

(34)

Here $m(\mathbf{r})=|\mathbf{m}(\mathbf{r})|$ is the magnitude and $\mathbf{\hat{m}}(\mathbf{r})$ is the direction of the magnetization. The generalization of Eq. 34 to GGAs are plagued by formal as well as numerical problems [193, 194] and the self-consistent solution of the non-collinear Kohn–Sham equations is presently restricted to LDA. It should be emphasized that the PAW formalism allows for a fully non-collinear treatment that does not rely on intra-atomic collinearity, which is often imposed in other electronic structure packages.

Spin–orbit coupling may be included by adding Eq. 28 to the Kohn–Sham Hamiltonian and this constitutes a fully self-consistent framework for spin–orbit calculations within LDA.

V.3 Orbital magnetization

Current ab initio methods for determining the orbital magnetization of a material involve either the modern theory, i.e. a Berry-phase formula, or the atom-centered approximation (ACA), where contributions to the expectation value of the angular momentum operator are restricted within atom-centered muffin-tin (MT) spheres with specified cutoff radii. The PAW formulation of the wave functions allows for an approximation similar to the MT-ACA where only the PAW expansion of the wave functions is assumed to contribute significantly to the expectation value of the angular momentum. The orbital magnetic moments can then be calculated through

\mathbf{m}^{a}_{\mathrm{orb}}=-\frac{e}{2m_{e}}\sum_{a}\sum_{ii^{\prime}}\sum_% {\sigma}D^{a}_{\sigma ii^{\prime}}\left<\phi^{a}_{i}\middle|\widehat{\mathbf{L% }}\middle|\phi^{a}_{i^{\prime}}\right>,

(35)

where $D^{a}_{\sigma ii^{\prime}}$ are elements of the atomic density matrix as defined in Eq. (9), $\phi^{a}_{i}(\mathbf{r})$ are the bound all-electron partial waves of atom $a$ , and $\hat{\mathbf{L}}$ is the angular-momentum operator.

We note that this expression does not entail cutoff radii for the atomic contributions to the orbital magnetic moments; instead, the entire all-electron partial waves are included although the all-electron atomic expansion is only formally exact inside the PAW spheres. Additionally, this means that the unbounded, i.e. non-normalisable, all-electron partial waves must be excluded in Eq. (35)

A prerequisite for nonzero orbital magnetization is broken time-reversal symmetry as represented by a complex Hamiltonian that is not unitarily equivalent to a real counterpart. Practically this means that a finite orbital magnetization requires magnetic order and either inclusion of spin–orbit coupling or non-coplanar spin textures. In GPAW, the spin–orbit interaction can be included either self-consistently in a non-collinear calculation or as a non-self-consistent post-processing step following a collinear calculation.

Crystal	bcc-Fe	fcc-Ni	fcc-Co	hcp-Co
Easy axis	$[001]$	$[111]$	$[111]$	$[001]$
PAW-ACA	$0.0611$	$0.0546$	$0.0845$	$0.0886$
MT-ACA [195]	$0.0433$	$0.0511$	$0.0634$	$0.0868$
Modern theory [195]	$0.0658$	$0.0519$	$0.0756$	$0.0957$
Experiment [196]	$0.081$	$0.053$	$0.120$	$0.133$

Table 2: Calculated and measured values for the orbital magnetization in units of

\mu_{\mathrm{B}}

per atom.

The orbital magnetization has been calculated using Eq. (35) for the simple ferromagnets bcc-Fe, fcc-Ni, fcc-Co, and hcp-Co, where the spin–orbit interaction is included non-self-consistently. The PAW-ACA results are displayed in Table 2 along with MT-ACA results, modern theory results, and measurements from experiment, demonstrating mainly that the PAW-ACA can be an improvement over the MT-ACA and secondly that there is decent agreement between the PAW-ACA and the modern theory for these systems.

V.4 Constant B-field

The coupling of the electronic spin magnetic moments to a constant external magnetic field $\mathbf{B}$ can be included by the addition of a Zeeman term in the Kohn–Sham Hamiltonian:

\Delta H_{\mathrm{B}}=\mathbf{B}\cdot\boldsymbol{\sigma}.

(36)

As an example, we can consider the spin-flop transition in Cr₂O₃. The ground state is anti-ferromagnetic with a weak anisotropy that prefers alignment of the spins with the $z$ -axis, and if a magnetic field is applied along the $z$ -direction, it becomes favorable to align the spins along a perpendicular direction (with a small ferromagnetic component) at the critical field where the Zeeman energy overcomes the anisotropy. This is clearly a spin–orbit effect and one has to carry out the calculations in a fully non-collinear framework with self-consistent spin–orbit coupling. In Fig. 4 we show the energy of the two alignments of spin as a function of external magnetic field obtained with LDA+U ( $U=2$ eV). The minimum energy configuration changes from $S_{z}$ to $S_{x}$ alignment at 6 T, i.e., the spin flips at this critical field value. This is in excellent agreement with the experimental value of Ref. [197]. The critical field is, however, rather strongly dependent on the chosen value of $U$ and increases by a factor of two in the absence of $U$ . The magnetic anisotropy (per unit cell) can be read off from the energy difference at $B=0$ T.

V.5 Spin spirals

The ground-state magnetic structure of frustrated and/or chiral magnets is often non-collinear and may be incommensurate with the chemical unit cell. In the classical isotropic Heisenberg model, the energy is always minimized by a planar spin spiral [198]. The energy of a general planar spin spiral can be evaluated efficiently within the chemical unit cell using the Generalized Bloch’s Theorem (GBT) [199]. In the GBT implementation of GPAW [40], there is no restriction on interatomic collinearity and thus it encodes a rotation of the all-electron magnetization density by an angle $\varphi=\mathbf{q}\cdot\mathbf{R}_{i}$ upon translations of a lattice vector $\mathbf{R}_{i}$ . The Kohn–Sham equations can then be solved self-consistently for a fixed wave vector $\mathbf{q}$ ,

\displaystyle\hat{H}_{\mathbf{q}}(\mathbf{k})\left|u_{\mathbf{q},n\mathbf{k}}% \right\rangle=\varepsilon_{\mathbf{q},n\mathbf{k}}\left|u_{\mathbf{q},n\mathbf% {k}}\right\rangle,

(37)

using only the periodic part of the generalized Bloch orbitals, $|\psi_{\mathbf{q},n\mathbf{k}}\rangle=U_{\mathbf{q}}^{\dagger}(\mathbf{r})e^{i% \mathbf{k}\cdot\mathbf{r}}|u_{\mathbf{q},n\mathbf{k}}\rangle$ , where $U_{\mathbf{q}}$ denotes the generator of spin rotations about the $z$ -axis:

\displaystyle U_{\mathbf{q}}(\mathbf{r})=\begin{pmatrix}e^{i\mathbf{q}\cdot% \mathbf{r}/2}&0\\ 0&e^{-i\mathbf{q}\cdot\mathbf{r}/2}\end{pmatrix}.

(38)

The generalized Bloch Hamiltonian without spin–orbit coupling is given by

\displaystyle\hat{H}_{\mathbf{q}}(\mathbf{k})

\displaystyle=e^{-i\mathbf{k}\cdot\mathbf{r}}U_{\mathbf{q}}(\mathbf{r})\hat{H}% U_{\mathbf{q}}^{\dagger}(\mathbf{r})e^{i\mathbf{k}\cdot\mathbf{r}},

(39)

and once Eq. (37) has been solved self-consistently for a given $\mathbf{q}$ , the corresponding spin-spiral energy is evaluated as usual, $E_{\mathrm{SS}}(\mathbf{q})=E_{\mathbf{q}}^{\mathrm{DFT}}[n,\mathbf{m}]$ .

Using the GPAW functionality to compute the spin-spiral energy as a function of $\mathbf{q}$ , one can then search for the ground-state wave vector $\mathbf{Q}$ that minimizes the energy. In Fig. 5, we show that the monolayer NiBr₂ has an incommensurate spin-spiral ground state, and that the local magnetic moment on the Ni atom depends only weakly on the wave vector $\mathbf{q}$ .

Furthermore, the orientation of the ground state spin spiral can be obtained by including spin–orbit coupling non-self-consistently in the projected spin–orbit approximation [200] and search for the orientation that minimizes the energy. The ordering vector $\mathbf{Q}$ and the normal vector $\mathbf{n}$ of the spiral plane thus constitute a complete specification of the magnetic ground state within the class of single- $\mathbf{q}$ states. The normal vector is of particular interest since spin spirals may lead to spontaneous breaking of the crystal symmetry, and the normal vector largely determines the direction of magnetically induced spontaneous polarization [40]. In Fig. 5, we show that $\mathbf{n}$ is perpendicular to the wavevector $\mathbf{Q}$ in the monolayer NiBr₂ ground state, corresponding to a cycloidal spin spiral.

VI Response functions and excitations

Linear response functions are the bread and butter of condensed matter physics. Their applications include the description of dielectric screening, optical and electron energy loss spectra, many-body excitations, and ground state correlation energies. In this section we describe the methods available in GPAW for calculating electronic response functions as well as GW quasiparticle band structures and optical excitations from the Bethe-Salpeter Equation (BSE). In addition to electronic response function, GPAW can also calculate the transverse magnetic susceptibility, which holds information about the magnetic excitations, e.g. magnons, and can be used to derive parameters for classical Heisenberg spin models and to estimate magnetic transition temperatures. Finally, we present methods to calculate Raman spectra of solids and quadratic optical response tensors for describing second harmonics generation and the Pockels electro-optical effect.

VI.1 Linear-response TDDFT

To linear order, the change in electron density induced by a time-dependent external (scalar) potential $V_{\mathrm{ext}}(\mathbf{r},t)$ is governed by the electronic susceptibility, $\chi$ :

\delta n(\mathbf{r},t)=\int_{-\infty}^{\infty}dt^{\prime}\int d\mathbf{r}^{% \prime}\,\chi(\mathbf{r},\mathbf{r}^{\prime},t-t^{\prime})V_{\mathrm{ext}}(% \mathbf{r}^{\prime},t^{\prime}).

(40)

The susceptibility is itself given by the Kubo formula [201]

\chi(\mathbf{r},\mathbf{r}^{\prime},t-t^{\prime})=-\frac{i}{\hbar}\theta(t-t^{% \prime})\langle\left[\hat{n}_{0}(\mathbf{r},t),\hat{n}_{0}(\mathbf{r}^{\prime}% ,t^{\prime})\right]\rangle_{0},

(41)

where the expectation value is taken with respect to the ground state at zero temperature and the density operators are cast in the interaction picture.

In the noninteracting Kohn–Sham system, the susceptibility can be evaluated explicitly from the Kohn–Sham orbitals, eigenvalues and occupations:

	$\displaystyle\chi_{0}(\mathbf{r},\mathbf{r}^{\prime},\omega)=$	$\displaystyle\lim_{\eta\rightarrow 0^{+}}2\,\sum_{n\mathbf{k}}\sum_{m\mathbf{k% }^{\prime}}\left(f_{n\mathbf{k}}-f_{m\mathbf{k}^{\prime}}\right)$
		$\displaystyle\times\frac{\psi^{}_{n\mathbf{k}}(\mathbf{r})\psi_{m\mathbf{k}^{% \prime}}(\mathbf{r})\psi^{}_{m\mathbf{k}^{\prime}}(\mathbf{r}^{\prime})\psi_{% n\mathbf{k}}(\mathbf{r}^{\prime})}{\hbar\omega-(\epsilon_{m\mathbf{k}^{\prime}% }-\epsilon_{n\mathbf{k}})+i\hbar\eta}.$		(42)

Based on the Kohn–Sham susceptibility, $\chi_{0}$ , the many-body susceptibility can be calculated via a Dyson-like equation [202]:

	$\displaystyle\chi(\mathbf{r},\mathbf{r}^{\prime},\omega)=\chi_{0}(\mathbf{r},% \mathbf{r}^{\prime},\omega)+$	$\displaystyle\iint d\mathbf{r}_{1}d\mathbf{r}_{2}\,\chi_{0}(\mathbf{r},\mathbf% {r}_{1},\omega)$
		$\displaystyle\times K_{\mathrm{Hxc}}(\mathbf{r}_{1},\mathbf{r}_{2},\omega)\chi% (\mathbf{r}_{2},\mathbf{r}^{\prime},\omega).$		(43)

Here, electronic interactions are accounted for via the Hartree-XC kernel, which is defined in terms of the effective potentials of time-dependent DFT (TDDFT) [203]:

K_{\mathrm{Hxc}}(\mathbf{r},\mathbf{r}^{\prime},t-t^{\prime})=v_{c}(\mathbf{r}% -\mathbf{r}^{\prime})+f_{\mathrm{xc}}(\mathbf{r},\mathbf{r}^{\prime},t-t^{% \prime})

(44)

where $v_{c}$ is the Coulomb interaction and $f_{\mathrm{xc}}$ is the XC-kernel defined as

f_{\mathrm{xc}}(\mathbf{r},\mathbf{r}^{\prime},t-t^{\prime})=\frac{\delta v_{% \mathrm{xc}}(\mathbf{r},t)}{\delta n(\mathbf{r}^{\prime},t^{\prime})}.

(45)

In prototypical linear-response TDDFT (LR-TDDFT) calculations, the exchange–correlation part of the kernel is either neglected (leading to the random phase approximation (RPA)) or approximated via the adiabatic local density approximation (ALDA),

v_{\mathrm{xc}}(\mathbf{r},t)\simeq\left.\frac{\partial[\epsilon_{\mathrm{xc}}% (n)n]}{\partial n}\right|_{n(\mathbf{r},t)},

(46)

where $\epsilon_{\mathrm{xc}}(n)$ denotes the XC-energy per electron of the homogeneous electron gas of density $n$ . The ALDA is in fact rather restricted as it only ensures a correct description of the kernel for metals in the long wave length limit. This implies that it cannot account for excitons in extended systems and furthermore it leads to a divergent XC-hole. The latter problem can be resolved by a simple renormalization of the ALDA, that regulates the ontop XC-hole and drastically improves the description of local correlations over the ALDA (and RPA) [37].

VI.1.1 Implementation for periodic systems

In crystalline systems, the susceptibility is periodic with respect to translations on the Bravais lattice, $\chi(\mathbf{r}+\mathbf{R},\mathbf{r}^{\prime}+\mathbf{R},\omega)=\chi(\mathbf% {r},\mathbf{r}^{\prime},\omega)$ . Consequently, the susceptibility can be Fourier-transformed according to

\chi_{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega)=\iint\frac{d\mathbf{r}% d\mathbf{r}^{\prime}}{\Omega}e^{-i(\mathbf{G}+\mathbf{q})\cdot\mathbf{r}}\chi(% \mathbf{r},\mathbf{r}^{\prime},\omega)e^{i(\mathbf{G}^{\prime}+\mathbf{q})% \cdot\mathbf{r}^{\prime}},

(47)

translating the Dyson equation (43) into a plane-wave matrix equation, which is diagonal in the wave vector $\mathbf{q}$ and can be inverted numerically.

By using a plane-wave representation, the crux of the implementation then becomes to calculate the reciprocal-space pair densities

n_{n\mathbf{k},m\mathbf{k}+\mathbf{q}}(\mathbf{G}+\mathbf{q})=\langle\psi_{n% \mathbf{k}}|e^{-i(\mathbf{G}+\mathbf{q})\cdot\mathbf{r}}|\psi_{m\mathbf{k}+% \mathbf{q}}\rangle,

(48)

and to Fourier transform the XC-kernel

f_{\mathrm{LDA}}(\mathbf{G})=\int_{\Omega_{\mathrm{cell}}}d\mathbf{r}\,e^{-i% \mathbf{G}\cdot\mathbf{r}}\left.\frac{\partial^{2}[\epsilon_{\mathrm{xc}}(n)n]% }{\partial n^{2}}\right|_{n(\mathbf{r})}.

(49)

Crucially, both can be evaluated by adding a PAW correction to the analogous pseudo quantity, which itself can be evaluated by means of a fast Fourier transform (FFT). Implementational details hereto are reported in Refs. [43] and [36].

The Hartree contribution to the kernel (44) is simply given by the bare Coulomb interaction (here given in atomic units)

v_{\mathrm{c},\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q})=\frac{4\pi}{|\mathbf{% G}+\mathbf{q}|^{2}}\delta_{\mathbf{G},\mathbf{G}^{\prime}},

(50)

and can be evaluated analytically when $\mathbf{q}$ is finite. However, in the optical limit, the $\mathbf{G}=\mathbf{0}$ component diverges and one needs to be careful when inverting the Dyson equation (43). In GPAW, this is handled by expanding the pair densities within $kp$ -perturbation theory:

\displaystyle\langle\psi_{n\mathbf{k}}|e^{-i(\mathbf{G}+\mathbf{q})\cdot% \mathbf{r}}|\psi_{m\mathbf{k}+\mathbf{q}}\rangle_{\mathbf{q}\rightarrow\mathbf% {0},\mathbf{G}=\mathbf{0}}=-i\mathbf{q}\cdot\frac{\langle\psi_{n\mathbf{k}}|% \boldsymbol{\nabla}|\psi_{m\mathbf{k}}\rangle}{\epsilon_{m\mathbf{k}}-\epsilon% _{n\mathbf{k}}}.

(51)

In the expansion, the diverging Coulomb term is exactly cancelled by the $\mathbf{q}$ -dependence of the pair densities, so that the product $\chi_{0}v_{\mathrm{c}}$ remains finite. For additional details, see Ref. [36].

VI.1.2 Spectral representation

GPAW offers two different ways of dealing with the frequency dependence of the Kohn–Sham susceptibility (42). One is to evaluate the expression explicitly for the frequencies of interest. This is advantageous if one is interested in a few specific frequencies. The other is to evaluate the associated spectral function

$\displaystyle S_{0,\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega)=$	$\displaystyle\frac{2}{\Omega}\sum_{\mathbf{k}}\sum_{n,m}(f_{n\mathbf{k}}-f_{m% \mathbf{k}+\mathbf{q}})$
	$\displaystyle\times n_{n\mathbf{k},m\mathbf{k}+\mathbf{q}}(\mathbf{G}+\mathbf{% q})n_{m\mathbf{k}+\mathbf{q},n\mathbf{k}}(-\mathbf{G}^{\prime}-\mathbf{q})$
	$\displaystyle\times\delta\left(\hbar\omega-\left[\epsilon_{m\mathbf{k}+\mathbf% {q}}-\epsilon_{n\mathbf{k}}\right]\right)$	(52)

from which the susceptibility can be obtained via a Hilbert transform,

\displaystyle\chi_{0,\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega)=\lim_{% \eta\rightarrow 0^{+}}\int_{-\infty}^{\infty}d\omega^{\prime}\,\frac{S_{0,% \mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega^{\prime})}{\omega^{\prime}-% \omega+i\eta}.

(53)

To converge the Hilbert transform, the spectral function is evaluated on a nonlinear frequency grid spanning the range of eigenenergy differences included in Eq. (52). Although the calculation becomes more memory-intensive as a result, it is usually faster to compute $\chi_{0}$ via its spectral function. Furthermore, since $S_{0}$ is linearly interpolatable, the tetrahedron method can be employed to improve the convergence with respect to $\mathbf{k}$ -points.

VI.2 Dielectric function

The longitudinal part of the dielectric tensor is related to the susceptibility $\chi$ as

\displaystyle\varepsilon^{-1}_{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},% \omega)=\delta_{\mathbf{G}\mathbf{G}^{\prime}}+v_{c,\mathbf{G}\mathbf{G}^{% \prime}}(\mathbf{q})\chi_{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega).

(54)

From the dielectric matrix, the macroscopic dielectric function, including local field corrections, is given by

\displaystyle\varepsilon_{M}(\mathbf{q},\omega)=\frac{1}{\varepsilon^{-1}_{00}% (\mathbf{q},\omega)}

(55)

from which it is straightforward to extract the optical absorption spectrum

\displaystyle\mathrm{ABS}=\mathrm{Im}\>\varepsilon_{M}(\mathbf{q}\rightarrow 0% ,\omega)

(56)

as well as the electron energy-loss spectrum

\displaystyle\mathrm{EELS}=-\mathrm{Im}\>\frac{1}{\varepsilon_{M}(\mathbf{q},% \omega)}.

(57)

It is also possible to define a symmetrized version of the dielectric matrix as

\displaystyle\tilde{\varepsilon}_{\mathbf{GG}^{\prime}}(\mathbf{q},\omega)=v^{% -1/2}_{\mathbf{G}}(\mathbf{q})\varepsilon_{\mathbf{GG}^{\prime}}(\omega,% \mathbf{q})v^{1/2}_{\mathbf{G}^{\prime}}(\mathbf{q}).

(58)

The susceptibility can be evaluated at the RPA level (by setting $V_{\text{xc}}$ in Eq. 43 to zero) or using one of the XC-kernels implemented in GPAW such as the the local ALDA, the non-local rALDA [37], or the bootstrap kernel [204].

The evaluation $\chi_{0}$ is computationally demanding since it involves an integration over the Brillouin zone (BZ) as well as a summation over occupied and unoccupied states. The $\mathbf{k}$ -point convergence can be increased substantially with the tetrahedron method [205]. Contrary to the simple point integration, where the $\delta$ -function in Eq. 52 is replaced by a smeared-out Lorentzian, the tetrahedron method utilizes linear interpolation of the eigenvalues and matrix elements.

In Fig. 6 we compare the dielectric function computed using the two different integration methods for two prototypical cases: (semiconducting) Si and (metallic) Ag. Since Si is semiconducting there are no low-energy excitations and consequently the imaginary part of the dielectric function is zero while the real part is flat for low frequencies. The point integration and tetrahedron integration yield the same value for $\omega\rightarrow 0$ , but the tetrahedron integration avoids the unphysical oscillations of $\varepsilon$ at higher frequencies, exhibited by the point integration due to the finite $\mathbf{k}$ -point sampling. For metals, the $\mathbf{k}$ -point convergence is even slower and the difference between the point integration and tetrahedron integration is thus even more pronounced for Ag. By increasing the $\mathbf{k}$ -point sampling, the point integration results will eventually approach the results obtained with the tetrahedron method, but at a much higher computational cost.

VI.2.1 Screening in low-dimensional systems

In a three dimensional (3D) bulk crystal, the macroscopic dielectric function is related to the macroscopic polarizability $\alpha_{M}$ of the material as

\displaystyle\varepsilon_{M}(\mathbf{q},\omega)=1+4\pi\alpha_{M}(\mathbf{q},% \omega).

(59)

Since $\varepsilon_{M}$ is related to the macroscopic average of the induced potential, it will depend on the unit cell in low-dimensional systems and tend to unity when increasing the cell in the non-periodic direction. In contrast, it is straighforward to generalize the definition of the polarizability to such that it measures the induced dipole moment per length or area rather than volume. The $d$ -dimensional polarizability is thus defined as $\alpha_{M}^{d}=\Omega^{d}\alpha_{M}$ where $\Omega^{d}$ is the cell volume with the periodic directions projected out. For example, for a 2D material $\Omega^{d}$ is simply the length of the unit cell in the non-periodic direction. To improve convergence with respect to the size of the unit cell, the Coulomb kernel is truncated using the reciprocal-space method introduced in Ref. [206]. This enables efficient calculations of dielectric properties of low-dimensional materials materials [207, 208]. This does, however, imply that the polarizability cannot be evaluated directly from the dielectric constant (which is just one), but one may obtain it directly from the susceptibility as

\displaystyle\alpha_{M}=-\lim\limits_{\mathbf{q}\to 0}\frac{\Omega^{d}}{q^{2}}% \chi_{00}(\mathbf{q}).

(60)

VI.3 Adiabatic-connection fluctuation-dissipation theorem

The adiabatic-connection fluctuation-dissipation theorem (ACFDT) is a highly promising approach for constructing accurate correlation functionals with non-local characteristics. Unlike regular XC-functionals, the ACFDT correlation functionals do not rely on error cancellation between exchange and correlation. Instead, the ACFDT provides an exact theory for the electronic correlation energy $E_{\text{c}}$ in terms of the interacting electronic suseptibility, which can be combined with the exact exchange:

E_{\text{c}}=-\int_{0}^{\infty}\frac{\mathrm{d}\omega}{2\pi}\int_{0}^{1}% \mathrm{d}\lambda\mathrm{Tr}\left[v_{\text{c}}\chi^{\lambda}\left(w\right)-v_{% \text{c}}\chi^{\text{KS}}(w)\right].

(61)

The response function can be expressed in terms of the Kohn–Sham response function and the exchange–correlation kernel $f_{\text{xc}}$

\chi^{\lambda}(\omega)=\chi^{\text{KS}}(\omega)+\chi^{\text{KS}}(\omega)\left[% \lambda v_{c}+f_{\text{xc}}(\omega)\right]\chi^{\lambda}(\omega).

(62)

The random phase approximation (RPA) can be derived from ACFDT if the XC-kernel $f_{\text{xc}}$ is neglected. RPA has the strength of capturing non-local correlation effects and provides high accuracy across different bonding types including van der Waals interaction [209, 210, 29, 211].

Simple exchange–correlation kernels can also be incorporated into the response function, such as the adiabatic LDA (ALDA) kernel. However, the locality of adiabatic kernels leads to divergent characteristics of the pair-correlation function [212]. This issue can be overcome by a renormalization (r) scheme [37], which is implemented in GPAW as rALDA. This class of renormalized kernels provides a significantly better description of the short-range correlations and hence also yield highly accurate total correlation energies [213, 214, 215, 216].

VI.4 Magnetic response

The LR-TDDFT framework described in Sec. VI.1 can be generalized to include spin degrees of freedom [43, 217]. Using the four-component density as the basic variable, $n^{\mu}(\mathbf{r})$ (where $\mu\in\{0,x,y,z\}$ ), one can define the four-component susceptibility tensor:

\chi^{\mu\nu}(\mathbf{r},\mathbf{r}^{\prime},t-t^{\prime})=-\frac{i}{\hbar}% \theta(t-t^{\prime})\langle\left[\hat{n}^{\mu}_{0}(\mathbf{r},t),\hat{n}^{\nu}% _{0}(\mathbf{r}^{\prime},t^{\prime})\right]\rangle_{0}.

(63)

In a similar fashion to Eq. (43), the many-body $\chi^{\mu\nu}$ can be calculated from the corresponding susceptibility tensor of the Kohn–Sham system, $\chi^{\mu\nu}_{\mathrm{KS}}$ . In the most general case, the Dyson equation for $\chi^{\mu\nu}$ is a matrix equation in the $\mu$ and $\nu$ indices, explicitly coupling the charge and spin degrees of freedom. However, for collinear magnetic systems, the transverse components of the susceptibility decouple from the rest in the absence of spin–orbit coupling.

VI.4.1 Transverse magnetic susceptibility

Taking the spins to be polarized along the $z$ -direction, the transverse magnetic susceptibility of collinear nonrelativistic systems may be expressed in terms of the spin-raising and spin-lowering density operators $\hat{n}^{+}(\mathbf{r})=(\hat{n}^{x}(\mathbf{r})+i\hat{n}^{y}(\mathbf{r}))/2=% \hat{\psi}^{\dagger}_{\uparrow}(\mathbf{r})\hat{\psi}_{\downarrow}(\mathbf{r})$ and $\hat{n}^{-}(\mathbf{r})=(\hat{n}^{x}(\mathbf{r})-i\hat{n}^{y}(\mathbf{r}))/2=% \hat{\psi}^{\dagger}_{\downarrow}(\mathbf{r})\hat{\psi}_{\uparrow}(\mathbf{r})$ . In the ALDA, the Dyson equation for $\chi^{+-}$ then becomes a scalar one [43],

	$\displaystyle\chi^{+-}(\mathbf{r},\mathbf{r}^{\prime},\omega)=$	$\displaystyle\chi^{+-}_{\mathrm{KS}}(\mathbf{r},\mathbf{r}^{\prime},\omega)+% \int\mathrm{d}\mathbf{r}_{1}$
		$\displaystyle\times\chi^{+-}_{\mathrm{KS}}(\mathbf{r},\mathbf{r}_{1},\omega)f^% {-+}_{\mathrm{LDA}}(\mathbf{r}_{1})\chi^{+-}(\mathbf{r}_{1},\mathbf{r}^{\prime% },\omega),$		(64)

where the transverse XC-kernel is given by $f^{-+}_{\mathrm{LDA}}=2W_{\mathrm{LDA}}(\mathbf{r})/n^{z}(\mathbf{r})$ . The susceptibilities $\chi^{+-}$ and $\chi^{+-}_{\mathrm{KS}}$ are themselves defined via the Kubo formula (63), where the latter can be evaluated explicitly in complete analogy to Eq. (42):

	$\displaystyle\chi^{+-}_{\mathrm{KS}}(\mathbf{r},\mathbf{r}^{\prime},\omega)=$	$\displaystyle\lim_{\eta\rightarrow 0^{+}}\sum_{n\mathbf{k}}\sum_{m\mathbf{k}^{% \prime}}\left(f_{n\mathbf{k}\uparrow}-f_{m\mathbf{k}^{\prime}\downarrow}\right)$
		$\displaystyle\times\frac{\psi^{}_{n\mathbf{k}\uparrow}(\mathbf{r})\psi_{m% \mathbf{k}^{\prime}\downarrow}(\mathbf{r})\psi^{}_{m\mathbf{k}^{\prime}% \downarrow}(\mathbf{r}^{\prime})\psi_{n\mathbf{k}\uparrow}(\mathbf{r}^{\prime}% )}{\hbar\omega-(\epsilon_{m\mathbf{k}^{\prime}\downarrow}-\epsilon_{n\mathbf{k% }\uparrow})+i\hbar\eta}.$		(65)

In GPAW, the transverse magnetic susceptibility is calculated using a plane-wave basis as described in Sec. VI.1.1, with the notable exception that no special care is needed to treat the optical limit, since the Hartree kernel plays no role in the Dyson equation (64). In terms of the temporal representation, it is at the time of this writing only possible to do a literal evaluation of Eq. (65) at the frequencies of interest. For metals, this means that $\eta$ has to be left as a finite broadening parameter, which has to be carefully chosen depending on the $\mathbf{k}$ -point sampling, see [43].

VI.4.2 The spectrum of transverse magnetic excitations

Based on the transverse magnetic susceptibility, one may calculate the corresponding spectral function [43]

\displaystyle S^{+-}(\mathbf{r},\mathbf{r}^{\prime},\omega)

\displaystyle=-\frac{1}{2\pi i}\left[\chi^{+-}(\mathbf{r},\mathbf{r}^{\prime},% \omega)-\chi^{-+}(\mathbf{r}^{\prime},\mathbf{r},-\omega)\right],

(66)

which directly governs the energy dissipation in a collinear magnet relating to induced changes in the spin projection along the $z$ -axis $S_{z}$ . In particular, one can decompose the spectrum into contributions from majority and minority spin excitations, $S^{+-}(\mathbf{r},\mathbf{r}^{\prime},\omega)=A^{+-}(\mathbf{r},\mathbf{r}^{% \prime},\omega)-A^{-+}(\mathbf{r}^{\prime},\mathbf{r},-\omega)$ , that is, into spectral functions for the excited states where $S_{z}$ has been lowered or raised by one unit of spin angular momentum respectively:

	$\displaystyle A^{+-}(\mathbf{r},\mathbf{r}^{\prime},\omega)=\sum_{\alpha>0}$	$\displaystyle\langle 0\|\hat{n}^{+}(\mathbf{r})\|\alpha\rangle\langle\alpha\|\hat% {n}^{-}(\mathbf{r}^{\prime})\|0\rangle$
		$\displaystyle\times\delta\left(\hbar\omega-\left[E_{\alpha}-E_{0}\right]\right).$		(67)

Here, $\alpha$ iterates the system eigenstates with $\alpha=0$ denoting the ground state.

For collinear magnetic systems, the spectrum $S^{+-}$ is composed of two types of excitations: collective spin-wave excitations (referred to as magnons) and excitations in the Stoner-pair continuum (electron–hole pairs of opposite spin). Since GPAW employs a plane-wave representation of the spectrum, $S^{+-}_{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega)$ , one can directly compare the calculational output to the inelastic neutron scattering cross-section measured in experiments [218]. In particular, one can extract the magnon dispersion directly by identifying the position of the peaks in the spectrum diagonal, see Fig. 7. In this way, GPAW allows the user to study various magnon phenomena in magnetic systems of arbitrary collinear order, such as nonanalytic dispersion effects in itinerant ferromagnets [43], correlation-driven magnetic phase transitions [219] and emergence of distinct collective modes inside the Stoner continuum of an antiferromagnet [220].

Additionally, one can analyze the spectrum more intricately by extracting the majority and minority eigenmodes from $S^{+-}$ as the positive and negative eigenvalues respectively. Contrary to analysis of the plane-wave diagonal, this makes it possible to completely separate the analysis of each individual magnon lineshape as well as the many-body Stoner continuum, see Fig. 7.

VI.4.3 Liechtenstein MFT

Not only can the transverse magnetic susceptibility be used to study magnetic excitations in a literal sense, but one can also use it to map the spin-degrees of freedom to a classical Heisenberg model,

E_{\mathrm{H}}=-\frac{1}{2}\sum_{i,j}\sum_{a,b}J_{ij}^{ab}\,\mathbf{u}_{ia}% \cdot\mathbf{u}_{jb},

(68)

where $i,j$ and $a,b$ denote the indices of the Bravais lattice and the magnetic sublattice, respectively, $\mathbf{u}_{ia}$ being the direction of spin-polarization of the given magnetic site. Based on the magnetic force theorem (MFT), the LSDA Heisenberg exchange parameters $J_{ij}^{ab}$ can be calculated from the reactive part of the static Kohn–Sham susceptibility [42] and the effective magnetic field $B_{\mathrm{xc}}(\mathbf{r})=\delta E_{\mathrm{xc}}/\delta m(\mathbf{r})$ , using the well-known Liechtenstein MFT formula [187]:

J_{ij}^{ab}=-2\int_{\Omega_{ia}}\mathrm{d}\mathbf{r}\int_{\Omega_{jb}}\mathrm{% d}\mathbf{r}^{\prime}\,B_{\mathrm{xc}}(\mathbf{r})\chi^{\prime+-}_{\mathrm{KS}% }(\mathbf{r},\mathbf{r}^{\prime})B_{\mathrm{xc}}(\mathbf{r}^{\prime}).

(69)

Here $\Omega_{ia}$ denotes the site volume, which effectively defines the Heisenberg model (68).

Using GPAW’s plane-wave representation of $\chi^{\prime+-}_{\mathrm{KS}}$ , one can directly compute the lattice Fourier-transformed exchange parameters [42],


$\displaystyle\bar{J}^{ab}(\mathbf{q})$	$\displaystyle=\sum_{i}J_{0i}^{ab}e^{i\mathbf{q}\cdot\mathbf{R}_{i}}$	(70a)
	$\displaystyle=-\frac{2}{\Omega_{\mathrm{cell}}}B_{\mathrm{xc}}^{\dagger}K^{a% \dagger}(\mathbf{q})\chi^{\prime+-}_{\mathrm{KS}}(\mathbf{q})K^{b}(\mathbf{q})% B_{\mathrm{xc}},$	(70b)

where the right-hand side of the second equality is written in a plane-wave basis and $K_{a}$ denotes the sublattice site-kernel:

K^{a}_{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q})=\frac{1}{\Omega_{\mathrm{% cell}}}\int_{\Omega_{0a}}\mathrm{}\mathrm{d}\mathbf{r}\,e^{-i(\mathbf{G}-% \mathbf{G}^{\prime}+\mathbf{q})\cdot\mathbf{r}}.

(71)

Since an a priori definition for the magnetic site volumes does not exist, GPAW supplies functionality to calculate exchange parameters based on spherical, cylindrical and/or parallelepipedic site configurations of variable size.

Upon calculation of the exchange parameters $\bar{J}^{ab}(\mathbf{q})$ , it is straight-forward to compute the magnon dispersion within the classical Heisenberg model using linear spin-wave theory and to estimate thermal quantities such as the Curie temperature, see e.g. [42]. In Fig. 8 the MFT magnon dispersion of hcp-Co is compared to majority magnon spectrum calculated within LR-TDDFT. For Co, the two are in excellent agreement except for the dispersion of the optical magnon mode along the $\mathrm{K}-\mathrm{M}-\Gamma$ high-symmetry path, where MFT underestimates the magnon frequency and neglects the fine structure of the spectrum. The fine structure of the ALDA spectrum appears due to the overlap between the magnon mode and the Stoner continuum. This gives rise to so-called Kohn anomalies (nonanalytical points in the magnon dispersion), which is a trademark of itinerant electron magnetism. Since the itinerancy is largely ignored in a localized spin model such as (68), one cannot generally expect to capture such effects.

VI.5 GW approximation

GPAW supports standard G₀W₀ quasiparticle (QP) calculations based on a first-order perturbative treatment of the linearized QP equation [9]

\displaystyle E_{n\mathbf{k}\sigma}^{\text{QP}}=\varepsilon_{n\mathbf{k}\sigma% }+Z_{n\mathbf{k}\sigma}\langle\psi_{n\mathbf{k}\sigma}|\Sigma_{\mathrm{GW}}(% \varepsilon_{n\mathbf{k}\sigma})+v_{\text{x}}-v_{\text{xc}}|\psi_{n\mathbf{k}% \sigma}\rangle,

(72)

where $\varepsilon_{n\mathbf{k}\sigma}$ and $\psi_{n\mathbf{k}\sigma}$ are Kohn–Sham eigenvalues and wave functions, and $v_{\text{xc}}$ and $v_{\text{x}}$ are the local XC-potential and nonlocal exchange potential, respectively. $\Sigma_{\mathrm{GW}}$ is the (dynamical part of) the GW self-energy whose frequency-dependence is accounted to first order by the renormalization factor $Z_{n\mathbf{k}\sigma}=(1-\mathrm{Re}\>\Sigma_{\mathrm{GW}}^{\prime}(% \varepsilon_{n\mathbf{k}\sigma}))^{-1}$ . As indicated by the $\sigma$ -index, spin-polarized G₀W₀ calculations are supported.

The GW self-energy is calculated in a plane-wave basis using full frequency integration along the real axis to evaluate the convolution between $G$ and $W$ [38]. Compared to alternative schemes employing contour deformation techniques or analytical continuation [221, 222, 223], this approach is time- consuming but numerically accurate and can provide the full spectral function. A highly efficient and accurate evaluation of the self-energy based on a multipole expansion of the screened interaction, $W$ [224], is currently being implemented, and a GPU version of the full GW code is under development.

An important technical issue concerns the treatment of the head and wings of $W$ ( $\mathbf{G}=0$ and/or $\mathbf{G}^{\prime}=0$ , respectively) in the $\mathbf{q}\rightarrow 0$ limit. The divergence of the Coulomb interaction appears both in the evaluation of the inverse dielectric matrix $\varepsilon^{-1}_{\mathbf{GG}^{\prime}}$ and in the subsequent evaluation of the screened interaction

\displaystyle W_{\mathbf{GG}^{\prime}}(\mathbf{q},\omega)=\sum_{\mathbf{G}^{% \prime\prime}}\varepsilon^{-1}_{\mathbf{GG}^{\prime\prime}}(\mathbf{q},\omega)% v_{c,\mathbf{G}^{\prime\prime}\mathbf{G}^{\prime}}(\mathbf{q}).

For 3D bulk crystals, GPAW obtains these components by evaluating $\varepsilon^{-1}_{\mathbf{GG}^{\prime}}$ on a dense $\mathbf{k}$ -grid centered at $\mathbf{k}=0$ while $v_{c,\mathbf{G}\mathbf{G}^{\prime}}$ can be integrated numerically or analytically around the $\Gamma$ -point.

For low-dimensional structures, in particular atomically thin 2D semiconductors, GPAW can use a truncated Coulomb interaction to avoid interactions between periodical images when evaluating $W$ [206]. It has been shown that the use of a truncated Coulomb kernel leads to slower $\mathbf{k}$ -point convergence [207]. To mitigate this drawback, a special 2D treatment of $W(q)$ at $q=0$ , can be applied to significantly improve the $\mathbf{k}$ -point convergence [225]. A detailed account of the GW implementation in GPAW can be found in Ref. [38].

Figure 9 shows two matrix elements of the dynamical GW self-energy for the valence and conduction band states at the Gamma point. As can be seen, the agreement with the corresponding quantities obtained with the Yambo GW code [226] is striking.

VI.6 Bethe–Salpeter Equation (BSE)

In addition to the LR-TDDFT discussed in Sec. VI.1, the interacting response function may be approximated by solving the Bethe–Salpeter equation (BSE) [11]. In particular, for a certain wave vector $\mathbf{q}$ , one may obtain the two-particle excitations by diagonalizing the Hamiltonian

	$\displaystyle H_{\begin{subarray}{c}\mathbf{k}_{2}m_{1}m_{2}\\ \mathbf{k}_{4}m_{3}m_{4}\end{subarray}}(\mathbf{q})$	$\displaystyle=\delta_{m_{1}m_{3}}\delta_{m_{2}m_{4}}\delta_{\mathbf{k}_{2}% \mathbf{k}_{4}}(\varepsilon_{m_{1}\mathbf{k}_{2}+\mathbf{q}}-\varepsilon_{m_{2% }\mathbf{k}_{2}})$		(73)
		$\displaystyle-(f_{m_{1}\mathbf{k}_{2}+\mathbf{q}}-f_{m_{2}\mathbf{k}_{2}})K_{m% _{2}\mathbf{k}_{2}m_{1}\mathbf{k}_{2}+\mathbf{q}m_{3}\mathbf{k}_{4}+\mathbf{q}% m_{4}\mathbf{k}_{4}},$

where $\varepsilon_{\mathbf{k}m}$ are the Kohn–Sham eigenvalues and $f_{\mathbf{k}m}$ the associated occupation numbers. The kernel is defined by $K=v_{\mathrm{c}}-W$ with

	$\displaystyle v_{c}=$	$\displaystyle\langle m_{2}\mathbf{k}_{2},m_{3}\mathbf{k}_{4}+\mathbf{q}\|\hat{v% }_{\mathrm{c}}\|m_{1}\mathbf{k}_{2}+\mathbf{q},m_{4}\mathbf{k}_{4}\rangle$		(74)
	$\displaystyle W=$	$\displaystyle\langle m_{2}\mathbf{k}_{2},m_{3}\mathbf{k}_{4}+\mathbf{q}\|\hat{W% }\|m_{4}\mathbf{k}_{4},m_{1}\mathbf{k}_{2}+\mathbf{q}\rangle,$		(75)

where $\hat{W}=\varepsilon^{-1}v_{\mathrm{c}}$ is the static screened Coulomb interaction. The matrix elements of the kernel are evaluated in a plane-wave basis where they are easily expressed in terms of the pair densities (48) and the reciprocal-space representation of the dielectric matrix (54).

In the Tamm–Dancoff approximation, one only includes states with $\varepsilon_{m_{1}\mathbf{k}_{2}+\mathbf{q}}-\varepsilon_{m_{2}\mathbf{k}_{2}}>0$ in Eq. (73) and the BSE Hamiltonian becomes Hermitian [11]. The interacting retarded response function may then be written as

\displaystyle\chi^{\mathbf{G}\mathbf{G}^{\prime}}(\mathbf{q},\omega)=\frac{1}{% \Omega}\sum_{\lambda}\frac{B_{\lambda}(\mathbf{q},\mathbf{G})C_{\lambda}^{*}(% \mathbf{q},\mathbf{G}^{\prime})}{\omega-E_{\lambda}(\mathbf{q})+i\eta},

(76)

where

	$\displaystyle B_{\lambda}(\mathbf{q},\mathbf{G})=\sum_{\mathbf{k}m_{1}m_{2}}n_% {m_{2}\mathbf{k}m_{1}\mathbf{k}+\mathbf{q}}(\mathbf{G}+\mathbf{q})A^{\lambda}_% {m_{1}m_{2}\mathbf{k}}(\mathbf{q}),$		(77)
	$\displaystyle C_{\lambda}(\mathbf{q},\mathbf{G})=\sum_{\mathbf{k}m_{1}m_{2}}n_% {m_{2}\mathbf{k}m_{1}\mathbf{k}+\mathbf{q}}(\mathbf{G}+\mathbf{q})A^{\lambda}_% {m_{1}m_{2}\mathbf{k}}(\mathbf{q})$
	$\displaystyle\qquad\qquad\qquad\qquad\times(f_{m_{1}\mathbf{k+q}}-f_{m_{2}% \mathbf{k}}),$		(78)

and $E_{\lambda}(\mathbf{q})$ denotes the eigenvalue of the Hamiltonian (73) corresponding to the eigenvector $A^{\lambda}_{m_{1}m_{2}\mathbf{k}}(\mathbf{q})$ .

In GPAW, the construction of the BSE Hamiltonian (73) proceeds in two steps [39]. First, the static screened interaction is calculated at all inequivalent $\mathbf{q}$ -points in a plane-wave basis and the Kernel is then subsequently expressed in a basis of two-particle KS states. The first step is efficiently parallelized over either states or $\mathbf{k}$ -points and the second step is parallelized over pair densities. The Hamiltonian elements are thus distributed over all CPUs and the diagonalization is carried out using ScaLAPACK such that the full Hamiltonian is never collected on a single CPU. The dimension of the BSE Hamiltonian and memory requirements are, therefore, only limited by the number of CPUs used for the calculation. We note that the implementation is not limited to the Tamm–Dancoff approximation, but calculations become more demanding without it. The response function may be calculated for spin-paired as well as spin-polarized systems and spin–orbit coupling can be included non-selfconsistently [227, 45]. In low-dimensional systems it is important to eliminate the spurious screening from periodic images of the structure, which is accomplished with the truncated Coulomb interaction of Ref. [206].

The most important application of BSE is arguably the calculation of optical absorption spectra of solids where BSE provides an accurate account of the excitonic effects that are not captured by semi-local approximations for $K_{\mathrm{Hxc}}$ (44). In 2D systems, the excitonic effects are particularly pronounced due to inefficient screening [207, 227] and in Fig. 10 we show the 2D polarizability of WS₂ calculated from BSE with $\mathbf{q}=\mathbf{0}$ . Comparing with Fig. 3, it is observed that the absorption edge is expected to be located at the $K$ point, where spin–orbit coupling splits the highest valence band by 0.45 eV. This splitting is seen as two excitonic peaks below the band gap, which is interpreted as distinct excitons originating from the highest and next-highest valence bands (the splitting of the lowest conduction band is negligible in this regard). For comparison, we also show the RPA polarizabilty obtained with the BSE module by neglecting the screened interaction in the kernel and this shows the expected absorption edge at the band gap. This yields identical results to the Dyson equation approach of Sec. VI.1, but has the advantage that the eigenvalues and weights of Eq. (76) are obtained directly such that the artificial broadening $\eta$ may be varied without additional computational cost. The eigenstate decomposition also allows one to access “dark states” and the BSE calculation reveals two (one for each valley) triplet-like excitons that are situated 70 meV below the lowest bright exciton in Fig. 10.

In addition to optical properties, GPAW allows for solving the BSE at finite $\mathbf{q}$ , which can used to obtain plasmon dispersion relations from the EELS (57) and magnon dispersions from the transverse magnetic susceptibility of Sec. VI.4 [228].

VI.7 Electron–phonon coupling

The electron-phonon coupling is the origin of several important materials properties, ranging from electrical and thermal conductivity to superconductivity. In addition it provides access to the deformation potential, which can be used to obtain transport properties for electrons and holes in solids [229].

The first-order electron–phonon coupling matrix $g_{mn}^{\nu}(\mathbf{k},\mathbf{q})$ measures the strength of the coupling between a phonon branch $\nu$ with wave vector $\mathbf{q}$ and frequency $\omega_{\nu}$ and the electronic states $m(\mathbf{k}+\mathbf{q})$ and $n(\mathbf{k})$ [230, 231]:

g_{mn}^{\nu}(\mathbf{k},\mathbf{q})=\sqrt{\frac{\hbar}{2m_{0}\omega_{\nu}}}M_{% mn}^{\nu}(\mathbf{k},\mathbf{q})

(79)

with

M_{mn}^{\nu}(\mathbf{k},\mathbf{q})=\langle\psi_{m\mathbf{k}+\mathbf{q}}|% \nabla_{u}v^{\text{KS}}\cdot\mathbf{e}_{\nu}|\psi_{n\mathbf{k}}\rangle.

(80)

Here $m_{0}$ is the sum of the masses of all the atoms in the unit cell, $\nabla_{u}$ denotes the gradient with respect to atomic displacements and $\mathbf{e}_{\nu}$ projects the gradient onto the direction of the phonon displacements. In the case of the three translational modes at $|\mathbf{q}|=0$ , the matrix elements $g_{mn}^{\nu}$ vanishes as a consequence of the acoustic sum rule [231].

In GPAW, $\nabla_{u}v^{\text{KS}}(\mathbf{r})$ is determined using a finite-difference method with a super cell description of the system. This step can be performed in any of the wave function representations available in GPAW. The derivative is then projected onto a set of atomic orbitals from an LCAO basis $\phi_{NM}$ , where $N$ denotes the cell index and $M$ the orbital index:

\displaystyle\mathbf{g}_{\begin{subarray}{c}NM\\ N^{\prime}M^{\prime}\end{subarray}}^{sc}=FT\left[\langle\phi_{NM}(\mathbf{k})|% \nabla_{u}v^{\text{KS}}|\phi_{N^{\prime}M^{\prime}}(\mathbf{k})\rangle\right].

(81)

The Fourier transform from the Bloch to the real space representation makes it possible to compute $M_{mn}^{\nu}$ for arbitrary $\mathbf{q}$ . Finally, the electron–phonon coupling matrix is obtained by projecting the matrix corresponding to the supercell into the primitive unit cell bands $m,n$ and phonon modes $\nu$ :

M_{mn}^{\nu}(\mathbf{k},\mathbf{q})=\sum_{\begin{subarray}{c}NM\\ N^{\prime}M^{\prime}\end{subarray}}C_{mM}^{*}C_{nM^{\prime}}\mathbf{g}_{\begin% {subarray}{c}NM\\ N^{\prime}M^{\prime}\end{subarray}}^{sc}\cdot\mathbf{u}_{q\nu}e^{i[(\mathbf{k}% +\mathbf{q})\cdot\mathbf{R}_{N}-\mathbf{k}\cdot\mathbf{R}_{N}^{\prime}]},

(82)

where $C_{nM}$ are the LCAO coefficients and $\mathbf{u}_{q\nu}$ are mass-scaled phonon displacement vectors.

VI.8 Raman spectrum

The Raman effect describes inelastic light scattering, where vibrational modes are excited within the material. Resonant and non-resonant Raman spectra of finite systems such as molecules can be calculated in various approximations [232] using the corresponding interfaces in ASE. The Stokes Raman intensity is then written as

I(\omega)=I_{0}\sum_{\nu}\frac{n_{\nu}+1}{\omega_{\nu}}|\sum_{\alpha,\beta}u_{% \mathrm{in}}^{\alpha}R_{\alpha\beta}^{\nu}u_{\mathrm{out}}^{\beta}|^{2}\delta(% \omega-\omega_{\nu}),

(83)

where $\nu$ denotes phonon mode at $\mathbf{q}=0$ with frequency of $\omega_{\nu}$ and $n_{\nu}$ is the corresponding Bose-Einstein distribution. Furthermore, $u_{\mathrm{in}}^{\alpha}$ and $u_{\mathrm{out}}^{\beta}$ are the polarization vectors of the incoming and outgoing light, and $R_{\alpha\beta}^{\nu}$ denotes the Raman tensor for phonon mode $\nu$ .

The predominant approach for calculating $R_{\alpha\beta}^{\nu}$ involves the use of the Kramers-Heisenberg-Dirac (KHD) method. Within the KHD framework, the Raman tensor is determined by taking the derivative (utilizing a finite-difference method) of the electric polarizability concerning the vibrational normal modes. Alternatively, one can employ time-dependent third-order perturbation theory to compute Raman tensors. These two approaches are equivalent when local field effect are negligible [56]. However, each approach comes with its own set of advantages and drawbacks. The KHD method is computationally more efficient but is limited to computing first-order Raman processes. The perturbative approach can be extended to higher-order Raman processes involving multiple phonons, but it is more computationally demanding, necessitating a greater number of bands and a finer k-mesh grid to achieve convergence. The perturbative approach has been implemented in GPAW and is elaborated below while the KHD method has been implemented in the ASR package [22], utilizing GPAW as the computational backend.

In the perturbative approach, the Raman tensor $R_{\alpha\beta}^{\nu}$ is given by [233, 56]

	$\displaystyle R_{\alpha\beta}^{\nu}\equiv\sum_{ijmn\mathbf{k}}\left[\frac{p_{% ij}^{\alpha}(M_{jm}^{\nu}\delta_{in}-M_{ni}^{\nu}\delta_{jm})p_{mn}^{\beta}}{(% \hbar\omega_{\mathrm{in}}-\varepsilon_{ji})(\hbar\omega_{\mathrm{out}}-% \varepsilon_{mn})}\right.$		(84)
	$\displaystyle+\frac{p_{ij}^{\alpha}(p_{jm}^{\beta}\delta_{in}-p_{ni}^{\beta}% \delta_{jm})M_{mn}^{\nu}}{(\hbar\omega_{\mathrm{in}}-\varepsilon_{ji})(\hbar% \omega_{\nu}-\varepsilon_{mn})}$
	$\displaystyle+\frac{p_{ij}^{\beta}(M_{jm}^{\nu}\delta_{in}-M_{ni}^{\nu}\delta_% {jm})p_{mn}^{\alpha}}{(-\hbar\omega_{\mathrm{out}}-\varepsilon_{ji})(-\hbar% \omega_{\mathrm{in}}-\varepsilon_{mn})}$
	$\displaystyle+\frac{p_{ij}^{\beta}(p_{jm}^{\alpha}\delta_{in}-p_{ni}^{\alpha}% \delta_{jm})M_{mn}^{\nu}}{(-\hbar\omega_{\mathrm{out}}-\varepsilon_{ji})(\hbar% \omega_{\nu}-\varepsilon_{mn})}$
	$\displaystyle+\frac{M_{ij}^{\nu}(p_{jm}^{\alpha}\delta_{in}-p_{ni}^{\alpha}% \delta_{jm})p_{mn}^{\beta}}{(-\hbar\omega_{\nu}-\varepsilon_{ji})(\hbar\omega_% {\mathrm{out}}-\varepsilon_{mn})}$
	$\displaystyle+\left.\frac{M_{ij}^{\nu}(p_{jm}^{\beta}\delta_{in}-p_{ni}^{\beta% }\delta_{jm})p_{mn}^{\alpha}}{(-\hbar\omega_{\nu}-\varepsilon_{ji})(-\hbar% \omega_{\mathrm{in}}-\varepsilon_{mn})}\right]$
	$\displaystyle\times f_{i}(1-f_{j})f_{n}(1-f_{m}),$

where the first term is referred to as the resonant part and the remaining terms represent different time orderings of the interaction in terms of Feynman diagrams. $p_{nm}^{\alpha}=\langle\psi_{n\mathbf{k}}|\hat{p}^{\alpha}|\psi_{m\mathbf{k}}\rangle$ is the momentum matrix element between electronic bands $m$ and $n$ , with transition energy $\varepsilon_{nm}=E_{n}-E_{m}$ in polarization direction $\alpha$ and $M_{nm}^{\nu}$ is the electron–phonon coupling strength in the optical limit $|\mathbf{q}|=0$ as defined in Eq. (82).

Fig. 11 shows the polarization-resolved Raman spectrum of bulk MoS₂ in the 2H phase as computed with a laser frequency of $\omega_{\mathrm{in}}=488$ nm. This example uses only the resonant term, as the other contributions are small in this case. It’s worth noting that we have conducted a comparison of the calculated spectra using the ASR package and observed a high level of agreement for several materials, for example MoS₂. The results obtained from both methods closely align with each other in terms of peak positions and dominant peaks and minor disagreements between the two methods can be attributed to differences in implementation details and the distinct approximations employed by each. Specifically, within the ASR package, we utilized the phonopy package [234] to compute phonon frequencies and eigenvectors, whereas in GPAW, we directly computed phonon frequencies and eigenvectors using ASE’s phonon module. Furthermore, in the ASR implementation, we rigorously enforced the symmetry of the polarizability tensor and the ASR results therefore typically exhibit a more accurate adherence to the required symmetry of the Raman tensor compared to the GPAW implementation.

VI.9 Quadratic optical response functions

The nonlinear optical response of materials can be obtained by going beyond first order perturbation theory. Presently, the GPAW implementation is restricted to second-order response within the dipole approximation and without inclusion of local field effects. We apply the independent particle approximation, which cannot capture collective behavior such as excitonic effects [57]. A spatially homogeneous incident electric field can be written in terms of its Fourier components as

\bm{\mathcal{E}}(t)=\sum_{\alpha,\omega_{1}}\mathcal{E}_{\alpha}(\omega_{1})% \mathbf{e}_{\alpha}e^{-i\omega_{1}t}\,,

(85)

where $\omega_{1}$ runs over positive and negative frequencies, $\mathbf{e}_{\alpha}$ denotes the unit vector along the $\alpha$ -direction, and $\mathcal{E}_{\alpha}(\omega_{1})$ is the electric field at frequency $\omega_{1}$ . The induced quadratic polarization density $\mathcal{P}_{\gamma}^{(2)}(t)$ can then be expressed as

	$\displaystyle\mathcal{P}_{\gamma}^{(2)}(t)=$	$\displaystyle\epsilon_{0}\sum_{\omega_{1}\omega_{2}}\sum_{\alpha\beta}\chi_{% \gamma\alpha\beta}^{(2)}(\omega_{1},\omega_{2})$
		$\displaystyle\qquad\qquad\times\mathcal{E}_{\alpha}(\omega_{1})\mathcal{E}_{% \beta}(\omega_{2})e^{-i(\omega_{1}+\omega_{2})t}\,,$		(86)

where $\chi_{\gamma\alpha\beta}^{(2)}$ is the rank-3 quadratic susceptibility tensor. Due to intrinsic permutation symmetry, i.e. $\chi_{\gamma\alpha\beta}^{(2)}(\omega_{1},\omega_{2})=\chi_{\gamma\beta\alpha}% ^{(2)}(\omega_{2},\omega_{1})$ , $\chi_{\gamma\alpha\beta}^{(2)}$ has at most 18 independent elements which may be further reduced by the Neumann principle and point group symmetries [235]. We note that the corresponding quadratic conductivity tensor is readily derived from $\chi_{\gamma\alpha\beta}^{(2)}$ as $\sigma_{\gamma\alpha\beta}^{(2)}(\omega_{1},\omega_{2})=-i\epsilon_{0}\chi_{% \gamma\alpha\beta}^{(2)}(\omega_{1},\omega_{2})(\omega_{1}+\omega_{2})$ due to relationship between current density and polarization density [236].

Among the various response functions that can be calculated from $\chi_{\gamma\alpha\beta}^{(2)}(\omega_{1},\omega_{2})$ , we have implemented second-harmonics generation (SHG) and the shift current tensor. The implementation currently requires time-reversal symmetry, which limits the application to non-magnetic systems. For SHG, the susceptibility tensor is separated into a pure interband term $\chi_{\gamma\alpha\beta}^{(2e)}$ and a mixed term $\chi_{\gamma\alpha\beta}^{(2i)}$ that read

		$\displaystyle\chi_{\gamma\alpha\beta}^{(2e)}\equiv C_{0}\sideset{}{{}^{\prime}% }{\sum}_{\mathbf{k},nml}\dfrac{\mathrm{Re}\{r_{nm}^{\gamma}(r_{ml}^{\alpha}r_{% ln}^{\beta}+r_{ml}^{\beta}r_{ln}^{\alpha})\}}{2(\varepsilon_{ln}-\varepsilon_{% ml})}$
		$\displaystyle\qquad\times\bigg{(}\frac{2f_{nm}}{2\hbar\omega-\varepsilon_{mn}}% -\frac{f_{nl}}{\hbar\omega-\varepsilon_{ln}}+\frac{f_{ml}}{\hbar\omega-% \varepsilon_{ml}}\bigg{)}\,,$		(87)
		$\displaystyle\chi_{\gamma\alpha\beta}^{(2i)}\equiv iC_{0}\sideset{}{{}^{\prime% }}{\sum}_{\mathbf{k},nm}f_{nm}\Bigg{[}\frac{2\mathrm{Im}\{r_{nm}^{\gamma}(r_{% mn;\beta}^{\alpha}+r_{mn;\alpha}^{\beta})\}}{\varepsilon_{mn}(2\hbar\omega-% \varepsilon_{mn})}$
		$\displaystyle\qquad+\frac{\mathrm{Im}\{r_{mn}^{\alpha}r_{nm;\beta}^{\gamma}+r_% {mn}^{\beta}r_{nm;\alpha}^{\gamma}\}}{\varepsilon_{mn}(\hbar\omega-\varepsilon% _{mn})}$
		$\displaystyle\qquad+\frac{\mathrm{Im}\{r_{nm}^{\gamma}(r_{mn}^{\alpha}\Delta_{% mn}^{\beta}+r_{mn}^{\beta}\Delta_{mn}^{\alpha})\}}{\varepsilon_{mn}^{2}}$
		$\displaystyle\qquad\qquad\qquad\times\bigg{(}\frac{1}{\hbar\omega-\varepsilon_% {mn}}-\frac{4}{2\hbar\omega-\varepsilon_{mn}}\bigg{)}$
		$\displaystyle\qquad-\frac{\mathrm{Im}\{r_{mn}^{\alpha}r_{nm;\gamma}^{\beta}+r_% {mn}^{\beta}r_{nm;\gamma}^{\alpha}\}}{2\varepsilon_{mn}(\hbar\omega-% \varepsilon_{mn})}\Bigg{]}\,.$		(88)

Here, $C_{0}\equiv e^{3}/(2\epsilon_{0}V)$ where $V$ is the crystal volume, $\Delta_{mn}^{\alpha}\equiv(p_{mm}^{\alpha}-p_{nn}^{\alpha})/m$ denotes the velocity difference between bands $n$ and $m$ , and $r_{nm}^{\alpha}$ is the interband ( $n\neq m$ ) position matrix element, obtained from $imr_{nm}^{\alpha}=\hbar p_{nm}^{\alpha}/\varepsilon_{nm}$ . All energies, occupations and matrix elements in the preceding expressions depend on the $\mathbf{k}$ -vector. Also, the summation over $\mathbf{k}$ implies an integral over the first BZ, i.e. $(2\pi)^{3}\sum_{\mathbf{k}}\rightarrow V\int_{\mathrm{BZ}}\mathrm{d}^{3}{% \mathbf{k}}$ . The primed summation signs indicate omission of terms with two or more identical indices. Finally, the generalized derivative $r_{nm;\alpha}^{\beta}$ (for $n\neq m$ ) are evaluated from the sum rule [237]

	$\displaystyle r_{nm;\alpha}^{\beta}$	$\displaystyle=\hbar\frac{r_{nm}^{\alpha}\Delta_{mn}^{\beta}+r_{nm}^{\beta}% \Delta_{mn}^{\alpha}}{\varepsilon_{nm}}$
		$\displaystyle+\frac{i\hbar}{\varepsilon_{nm}}\sum_{l\neq n,m}(\varepsilon_{lm}% r_{nl}^{\alpha}r_{lm}^{\beta}-\varepsilon_{nl}r_{nl}^{\beta}r_{lm}^{\alpha})\,.$		(89)

Here, infinite sums have been substituted with finite sums over a limited, yet sizable, set of bands. It’s important to emphasize that both sides of the equation are dependent on the $\mathbf{k}$ -vector, and the summation on the right-hand side pertains exclusively to bands. It should be mentioned that another implementation for the quadratic susceptibility tensor in the velocity gauge is also available in the code, but is not documented here. For sufficiently many bands, the results of the two implementations are identical [236].

Regarding the shift current, where a DC current is induced in response to an incident AC field, one needs to compute the quadratic conductivity tensor $\sigma_{\gamma\alpha\beta}^{(2)}$ :

	$\displaystyle\sigma_{\gamma\alpha\beta}^{(2)}$	$\displaystyle=\frac{\pi e^{3}}{\hbar^{2}V}\sideset{}{{}^{\prime}}{\sum}_{% \mathbf{k},nm}f_{nm}\mathrm{Im}\{r_{mn}^{\alpha}r_{nm;\gamma}^{\beta}+r_{mn}^{% \beta}r_{nm;\gamma}^{\beta}\}$
		$\displaystyle\qquad\qquad\times\delta(\hbar\omega-\varepsilon_{mn})\,.$		(90)

In practice, the delta function is replaced by a Lorentzian with a finite broadeining $\eta$ . To avoid numerical instabilities, the implementation of Eqs. (VI.9)-(VI.9) uses a tolerance, such that terms are neglected if the associated energy differences or differences in Fermi levels are smaller than a tolerance. The default value of the tolerance is $10^{-6}$ eV, and $10^{-4}$ for the energy and Fermi level differences, respectively.

VII Real-time TDDFT

The real-time TDDFT (RT-TDDFT) scheme, also known as time-propagation TDDFT, is implemented in FD [34] and LCAO [52] modes. It requires non-periodic boundary conditions, but is not restricted to the linear regime and can be applied to model molecules in strong fields. The metod may combined with hybrid quantum-classical modeling to simulate the dynamical interaction between molecules and plasmon resonances at metal surfaces [238]. The LCAO-RT-TDDFT is the more recent implementation supporting versatile analyses and enabling modeling of large systems thanks to its efficiency [52, 53, 239, 54, 240, 241, 242, 243, 244, 245]. We focus on the capabilities of the LCAO version in this section, but some of the described functionalities are also available in FD mode.

The time-dependent KS equation in the PAW formalism is

\displaystyle i\hat{S}\frac{\mathrm{d}}{\mathrm{d}t}|\tilde{\psi}_{n}(t)% \rangle=\left(\hat{H}[n(\mathbf{r})]+\hat{v}(t)\right)|\tilde{\psi}_{n}(t)\rangle,

(91)

where the Kohn–Sham Hamiltonian $\hat{H}[n(\mathbf{r})]$ is implicitly dependent on time through the time-dependent density and $\hat{v}(t)$ is an explicitly time-dependent external potential. We have additionally assumed that the overlap matrix $\hat{S}$ is independent of time, i.e. there are no ion dynamics.

Starting from the ground state, Eq. (91) is propagated forward numerically. After each step, a new density is computed and $\hat{H}[n(\mathbf{r})]$ updated accordingly. The user can can freely define the external potential and implemented standard potentials include the delta-kick $\hat{v}(t)=\hat{x}\delta(t)$ and a Gaussian pulse $\hat{v}(t)=\hat{x}\exp[-\sigma^{2}(t-t_{0})^{2}/2]\cos[\omega(t-t_{0})]$ , where $\hat{x}$ is the dipole operator in the direction $x$ .

During the propagation, different time-dependent variables can be recorded, and after the propagation they can be post-processed to quantities of physical and chemical interest. As a basic example, the time-dependent dipole moment recorded for a delta-kick perturbation can be Fourier-transformed to yield the photoabsorption spectrum [246]. Observables are recorded by attaching observers to the calculation and implemented observers include writers for the dipole moment, magnetic moment, KS density matrix in frequency space and wave functions.

RT-TDDFT calculations can be started from the ground state or continued from the last state of a previous time propagation and the time-limiter feature allows one to limit jobs to a predefined amount of wall time. Together with the continuation capability, this facilitates time propagation in short chunks, efficiently using shared high-performance resources.

In the LCAO-RT-TDDFT implementation, Eq. (91) is cast into a matrix equation and solved with ScaLAPACK [52]. The intermediate step of updating the Hartree and XC potential is performed on the real-space grid.

VII.1 Kohn–Sham decomposition

The time-dependent KS density matrix can be written as

\displaystyle\begin{split}\rho_{nn^{\prime}}(t)=\sum_{m}f_{m}\times&\int\left(% \psi^{(0)}_{n}(\mathbf{r})\right)^{*}\psi_{m}(\mathbf{r},t)\mathrm{d}{\mathbf{% r}}\\ \times&\int\psi^{*}_{m}(\mathbf{r}^{\prime},t)\psi^{(0)}_{n^{\prime}}(\mathbf{% r^{\prime}})\mathrm{d}{\mathbf{r}^{\prime}},\end{split}

(92)

where $\psi^{*}_{m}(\mathbf{r}^{\prime},t)$ are the time-dependent KS orbitals with ground state occupation factors $f_{m}$ . The KS density matrix is a central quantity enabling computation of observables and may be evaluated efficiently in the LCAO mode [54].

The Fourier transform of the induced density matrix $\delta\rho_{nn^{\prime}}(\omega)=\mathcal{F}[\rho_{nn^{\prime}}(t)-\rho_{nn^{% \prime}}(0)]$ can be built on the fly during time propagation through the density-matrix observer. Details on the implementation are described in Ref. [54]. The KS density matrix in frequency space is related to the Casida eigenvectors and gives similar information as the solution of the Casida equation [54]. Observables such as the polarizability can be decomposed into a sum over the electron–hole part of the KS density matrix $\rho_{nn}$ , where $f_{n}>f_{n^{\prime}}$ . This enables illustrative analyses, e.g., by visualizing $\rho_{nn^{\prime}}(\omega)$ on energy axes as a transition contribution map [247], from which the nature of the localized surface plasmon resonance can be understood (Fig. 12, see Ref. [54] for detailed discussion).

VII.2 Hot-carrier analysis

The KS density matrix is a practical starting point for analyzing hot-carrier generation in plasmonic nanostructures. In the regime of weak perturbations, the KS density matrix resulting from arbitrary pulses can be obtained from delta-kick calculations by a post-processing routine [243, 244]. By decomposing the matrix into different spatial and energetic contributions, hot-carrier generation in nanostructures [243, 245] and across nanoparticle-semiconductor [241] and nanoparticle-molecule [240, 244] interfaces can be studied. Computational codes and workflows for such hot-carrier analyses are provided in Refs. [248, 249].

VII.3 Circular dichroism for molecules

Electronic circular dichroism (ECD) is a powerful spectroscopic method for investigating chiral properties at the molecular level. The quantity that characterizes the ECD is the rotatory strength, which is defined through the magnetic dipole moment $m(\omega)$ as

R(\omega)=\frac{1}{\pi\kappa}\mathrm{Re}\bigg{[}\sum_{\alpha}m_{\alpha}^{(% \alpha)}(\omega)\bigg{]},

(93)

where the index $\alpha$ enumerates Cartesian directions, the $\alpha$ superscript in parenthesis indicates the $\delta$ -kick direction and $\kappa$ is the strength of the kick. To resolve $R(\omega)$ , one needs to perform the $\delta$ -kick in all three Cartesian directions using a perturbing electric field $\mathbf{E}^{\alpha}(t)=\kappa\hat{\mathbf{\alpha}}\delta(t)$ . The frequency components of the dipole moment $m_{i}^{(i)}(\omega)$ are calculated by Fourier transforming $m_{\alpha}^{(\alpha)}(t)$ , which is recorded during the propagation. Finally, the time-dependent magnetic dipole moment is obtained as the expectation value of the operator $\hat{\mathbf{m}}=-\frac{i}{2c}\hat{\mathbf{r}}\times\hat{\nabla}$ ,

\mathbf{m}(t)=\sum_{n}f_{n}\int\psi_{n}^{*}({\mathbf{r}},t)\,\hat{\mathbf{m}}% \,\psi_{n}({\mathbf{r}},t)~{}{\rm d}\mathbf{r},

(94)

where $f_{n}$ is the occupation number of KS orbital $n$ and $\psi_{n}({\bf r},t)$ is the time-evolved KS state. The current GPAW implementation supports both the FD and LCAO mode and the computational efficiency of the LCAO mode enables calculation of the ECD of nanoscale metal-organic nanoclusters. More details on the implementation can be found in Ref. [55].

VII.4 Radiation reaction potential

Plasmonic or collective molecular excitations are strongly susceptible to any kind of optical interaction. Induced currents will couple via Maxwell’s equation of motion to the optical environment and result in radiative losses, i.e., decay towards the ground state. It is possible to solve the Maxwell problem formally by obtaining Green’s tensor $\textbf{G}_{\perp}(\omega)$ . The dipolar interaction between electric field and electronic dipole can be absorbed into a local potential $v_{\text{rr}}(\textbf{r},t)=e\textbf{r}\cdot\textbf{E}_{r,\perp}(t)=e\textbf{r% }\cdot\big{[}\mathcal{F}_{t}^{-1}(i\mu_{0}\omega\textbf{G}_{\perp}(\omega))% \ast\partial_{t}\textbf{R}(t)\big{]}$ suitable for Kohn–Sham TDDFT, where $\textbf{R}(t)$ is the total electronic dipole moment. A detailed discussion can be found in Ref. [250].

For many simple structures, such as free-space, one-dimensional wave-guides, or dielectric spheres, $\textbf{G}_{\perp}(\omega)$ is analytically known and radiative losses can then be included in TDDFT without additional coimputational cost. The tutorials on the GPAW web-page [61] includes an example for one-dimensional wave-guides for which the user can specify the cross sectional area and the polarization of the propagating modes. Extending the functionality of the radiation-reaction potential to 3D free-space and collective interaction of large ensembles in Fabry-Pérot cavities from first principles [251] is essential for the understanding of polaritonic chemistry. This functionality is currently under development.

VII.5 Ehrenfest dynamics

Molecular dynamics (MD) simulations usually rely on the Born-Oppenheimer approximation, where the electronic system is assumed to react so much faster than the ionic system that it reaches its ground state at each time step. Thus, forces for the dynamics are calculated from the DFT ground-state density. While this approximation is sufficiently valid in most situations, there are cases where the explicit dynamics of the electronic system can affect the molecular dynamics, or the movement of the atoms can affect averaged spectral properties. These cases can be handled using so-called Ehrenfest dynamics, ie. time- dependent density functional theory molecular dynamics (TDDFT/MD).

Ehrenfest dynamics is implemented in the FD mode [252]. A description of the theory and a tutorial is available on the GPAW web-page [61]. This functionality has been used to model the electronic stopping of ions including core-electron excitations [253], study charge transfer at hybrid interfaces in the presence of water [254], simulate the coherent diffraction of neutral atom beams from graphene [255], model the dependence of carbon bond breaking under Ar⁺-ion irradiation on $sp$ hybridization [256], and reveal charge-transfer dynamics at electrified sulfur cathodes [257]. An LCAO implementation, inspired by recent work in the Siesta code [258, 259], is currently under development.

VIII Excited-state DFT methods

VIII.1 Improved virtual orbitals

The linear response TDDFT approach generally provides reasonably accurate excitation energies for low-lying valence excited states, where the orbitals associated with the holes and excited electrons overlap significantly. However, it tends to fail for excitations involving spatial rearrangment of the electrons, such as charge transfer [260, 113], Rydberg [261] and doubly excited states [262].

Some of these problems can be alleviated by using range separated functionals (see section III.3.4). However, these functionals come with a significantly increased computational cost due to the evaluation of exchange integrals. Moreover, due to the missing cancellation of Coulomb and exchange terms for canonical unoccupied orbitals within Hartree-Fock theory, one obtains spurious unoccupied orbitals. This leads to slow covergence of linear-response TDDFT calculations with respect to the number of unoccuppied orbitals when hybrid and range-separated functionals are used [122, 123].

Substantial improvement in convergence with respect to unoccupied orbitals can be obtained using improved virtual orbitals as devised by Huzinaga and Arnau [263, 264]. In this approach, a modified Fock operator is used for the unoccupied orbitals, which mimics the interaction between a hole arbitrarily chosen among the occupied ground state orbitals and the excited electron. This leads to faster convergence in linear-response TDDFT calculations with hybrid and range-separated functionals, and also makes it possible to evaluate excited state properties. For example, the energetics of long-range charge transfer can be obtained by means of a ground state calculations because the difference between the energy of an improved virtual orbital and a hole tends to approximate the excitation energy. The improved virtual orbitals approach is available in GPAW and details on the implementation are described in Ref. [123].

VIII.2 Variational excited-state calculations

GPAW also offers the possibility to perform excited-state calculations using an alternative time-independent density functional approach [265] (sometimes referred to as the “ $\Delta$ SCF” method), which does not suffer from the limitations of linear-response TDDFT mentioned in the previous section. The method involves variational optimization of the orbitals corresponding to a specific excited state by optimizing the electronic energy to a stationary point other than the ground state. The computational effort is similar to that of ground-state calculations and the variational optimization guarantees that the Hellmann–Feynman theorem is fulfilled. Therefore, all the ground-state machinery available in GPAW to evaluate atomic forces can be used for geometry optimization and for simulating the dynamics of atoms on the excited state. Furthermore, coupling this time-independent, variational approach for excited-state calculations with external MM potentials (see section IV) does not involve additional implementation efforts compared to ground-state calculations and provides a means for performing excited-state QM/MM molecular dynamics simulations that include the state-specific response of a solvent, i.e. the response due to changes in the electron density of the solute.

Variationally optimized excited states correspond to single Slater determinants of optimal orbitals with a non-aufbau occupation and are typically saddle points on the electronic energy surface. Hence, variational calculations of excited states are prone to collapsing to lower-energy solutions, which preserve the symmetry of the initial guess. A simple maximum overlap method (MOM) [266, 267] is available in GPAW to address this problem. At each SCF step, the MOM occupies those orbitals that overlap the most with the orbitals of a non-aufbau initial guess, usually obtained from a ground-state calculation. The MOM, however, does not guarantee that variational collapse is avoided, and convergence issues are common when using SCF eigensolvers with density-mixing schemes developed for ground-state calculations.

VIII.2.1 Direct orbital optimization

To alleviate the issues leading to variational collapse and achieve more robust convergence to excited-state solutions, GPAW contains two alternative strategies that are more reliable than conventional SCF eigensolvers with the MOM. They are based on direct optimization of the orbitals and use saddle point search algorithms akin to those for transition-state searches on potential energy surfaces of atomic rearrangements. These approaches also facilitate variational excited-state calculations of nonunitary invariant functionals, such as self-interaction corrected functionals (see section III.3.5).

The first of these methods is a direct orbital optimization approach supplemented with the MOM (DO-MOM). This method is an extension of the direct minimization approach using the exponential transformation illustrated in section III.2.5 where the search is for a generic stationary point of $E[\mathbf{\Psi}]$ instead of a minimum:

\underset{\mathbf{\Psi}}{\rm stat\,}E[\mathbf{\Psi}]=\underset{\mathbf{\Psi}_{% 0}}{\rm min\,}\underset{\mathbf{A}}{\rm stat\,}\mathcal{F}[\mathbf{A},{\mathbf% {\Psi}_{0}}]\,.

(95)

The DO-MOM is available in GPAW for both LCAO [51, 268], real-space grid, and plane wave basis sets [50]. For the LCAO basis, the excited-state optimization only necessitates making the energy stationary with respect to the elements of the anti-Hermitian matrix $\mathbf{A}$ (the orbital rotation angles), while calculations using the real-space grid and plane-wave basis include an outer-loop minimization with respect to the reference orbitals ${\mathbf{\Psi}_{0}}$ . The optimization in the linear space of anti-Hermitian matrices uses efficient quasi-Newton algorithms that can handle negative Hessian eigenvalues and therefore converge on saddle points. GPAW implements a novel limited-memory SR1 (L-SR1) algorithm, which has proven to be robust for calculations of excitations in molecules [50, 51].

The DO-MOM relies on estimating the degrees of freedom along which the energy needs to be maximized starting from an initial guess. For valence and Rydberg excitations, the initial guess consisting of ground-state canonical orbitals with non-aufbau occupation numbers and preconditioning with a diagonal approximation of the electronic Hessian using the orbital energies [79] can be sufficient. However, if the excitation involves significant charge transfer, large rearrangements of the energy ordering of the orbitals can occur, and DO-MOM can struggle to converge. A second direct optimization method with generalized mode following (DO-GMF) [49] alleviates these problems and is also implemented. In DO-GMF, the components of the energy gradient $\mathbf{g}$ along the modes $\mathbf{v}_{i}$ corresponding to the $n$ lowest eigenvalues of the electronic Hessian are inverted yielding

\mathbf{g}^{\mathrm{\,mod}}=\mathbf{g}-2\sum_{i=1}^{n}\mathbf{v}_{i}\mathbf{v}% _{i}^{\mathrm{T}}\mathbf{g}\,,

(96)

and a minimization using the modified gradient $\mathbf{g}^{\mathrm{\,mod}}$ is performed by following the $n$ modes simultaneously. This procedure guarantees convergence to an $n^{\rm th}$ -order saddle point, eliminating the risk of variational collapse altogether. While it is computationally more expensive than DO-MOM due to the need for a partial Hessian diagonalization, DO-GMF is more robust. Hence, it is particularly useful in the exploration of potential energy surfaces because it is able to follow an excited state through bond-breaking configurations, where broken-symmetry solutions appear, by targeting the solution that preserves the saddle point order. This important advantage is exemplified by the challenging double-bond twisting in ethylene [49, 133], where DO-GMF calculations of the lowest doubly excited state provide an avoided crossing with the ground state whereas other methods fail to do so.

VIII.2.2 Example applications of direct optimization

The efficiency and robustness of the direct-optimization approaches combined with the possibility of choosing different basis-set types make variational calculations of excitated states in GPAW applicable to a great variety of systems ranging from molecules in gas phase or solution to solids.

State-specific orbital relaxation enables the description of challenging excitations characterized by large density rearrangements. Figure 13 shows the error on the vertical excitation energy of a charge-transfer excitation in the twisted N-phenylpyrrole molecule [49] obtained with direct optimization in GPAW using the LDA, PBE and BLYP functionals and an sz+aug-cc-pVDZ [53] basis set as compared to the results of linear-response TDDFT calculations with the same basis set and functionals, as well as the hybrid functionals PBE0 and B3LYP (results from Ref. [269]). For the variational calculations, the energy of the singlet excited state is computed using the spin-purification formula $E_{\mathrm{s}}=2E_{\mathrm{m}}-E_{\mathrm{t}}$ , where $E_{\mathrm{m}}$ is the energy of a spin-mixed state obtained by promoting an electron in one spin channel and $E_{\mathrm{t}}$ is the energy of the triplet state with the same character. The variational calculations underestimate the theoretical best-estimate value of the excitation energy (5.58 eV) in Ref. [269] by 0.15-0.3 eV, an error that is significantly smaller than that of linear-response TDDFT calculations with the same functionals ( $-2.0$ eV) or with the more computationally intensive PBE0 hybrid functional ( $-0.85$ eV) [269].

The method has also been used to simulate the photoinduced structural changes of photocatalytic metal complexes and concomitant solvation dynamics [270, 165, 182, 272, 181]. Figure 13 shows an application to the prototypical copper complex photosensitizer [Cu(dmphen)₂]⁺ (dmphen=2,9-dimethyl-1,10-phenanthroline) in acetonitrile, where QM/MM molecular dynamics simulations elucidated an intricate interplay between deformation of the ligands and rearrangement of the surrounding solvent molecules following a photoexcitation to a metal-to-ligand charge-transfer state [270, 165].

The last example of an application shown in Figure 13 is a calculation of the excited states of a solid state system [273], the negatively charged nitrogen-vacancy center in diamond, which comprises a prototypical defect for quantum applications. The system is described with a large supercell of up to 511 atoms, and the calculations use a plane-wave basis set. In contrast to previous reports, a range of different density functionals is found to give the correct energy ordering of the excited states, with the r²SCAN functional providing the best agreement with high-level many-body quantum embedding calculations with an error of less than $0.06$ eV [273, 271]. This example shows that the direct optimization methods in GPAW are promising tools for simulating excited states in extended systems, where alternative approaches are either computationally expensive or lack accuracy.

IX Other features

IX.1 Electric polarization

The formal polarization of bulk materials may be calculated from the modern theory of polarization [274, 275] as

\mathbf{P}_{\mathrm{F}}=\mathbf{P}_{\mathrm{F}}^{\mathrm{el}}+\mathbf{P}_{% \mathrm{F}}^{\mathrm{n}},

(97)

where

\mathbf{P}_{\mathrm{F}}^{\mathrm{el}}=\frac{e}{(2\pi)^{3}}\mathrm{Im}\int_{% \mathrm{BZ}}\mathrm{d}\mathbf{k}\sum_{n}f_{\mathbf{k}n}\langle u_{\mathbf{k}n}% |\nabla_{\mathbf{k}}|u_{\mathbf{k}n}\rangle

(98)

is the electronic contribution and

\mathbf{P}_{\mathrm{F}}^{\mathrm{n}}=\frac{1}{V_{\mathrm{cell}}}\sum_{a}Z^{a}% \mathbf{r}^{a}

(99)

is the contribution from the nuclei. Here the sums run over atoms in the unit cell and $Z^{a}$ is the charge of nucleus $a$ (including core electrons), situated at position $\mathbf{r}^{a}$ . The electronic contribution can be viewed as a Brillouin-zone integral of $\mathbf{k}$ -space Berry phases and may be evaluated from a finite-difference version of Eq. (98) [276]. This involves the overlaps between Bloch orbitals at neighbouring $\mathbf{k}$ -points, which are straightforward to evaluate in the PAW formalism [46].

Eq. (97) is only defined modulo $e\mathbf{R_{i}}/V_{\mathrm{cell}}$ , which follows from the arbitrary choice of unit cell for the atomic positions as well as the choice of phases for $u_{\mathbf{k}n}$ , which can shift the Berry phase by $2\pi$ . The change in polarization under any adiabatic deformation is, however, well defined and may be calculated as

\Delta\mathbf{P}=\int_{0}^{\lambda}\mathrm{d}\lambda\frac{\mathrm{d}\mathbf{P}% _{\mathrm{F}}}{\mathrm{d}\lambda},

(100)

where $\lambda$ is some dimensionless variable parameterizing the adiabatic path. In particular, for ferroelectrics the spontaneous polarization $\mathbf{P}_{\mathrm{S}}$ can be evaluated by choosing a path that deforms the structure from the polar ground state at $\lambda=1$ to a non-polar structure at $\lambda=0$ .

In Fig. 14, we show an example of this for tetragonal KNbO₃, which is a well-known ferroelectric [277]. The polar structure was relaxed under the constraint of tetragonal symmetry using PBE ( $\lambda=1$ ) and then linearly interpolated to the inverted structure ( $\lambda=-1$ ) passing through a centrosymmetric point at $\lambda=0$ . There are infinitely many polarization branches differing by the polarization quantum $ec/V_{\mathrm{cell}}$ ( $c$ being the lattice constant in the $z$ -direction) and the spontaneous polarization is obtained by choosing a single branch and evaluating the difference in formal polarization at $\lambda=1$ and $\lambda=0$ . Interestingly, the centrosymmetric point has a non-vanishing polarization given by half the polarization quantum. This is allowed due to the multi-valued nature of the formal polarization and such a “topological polarization” in non-polar materials has been shown to yield gapless surface states [278, 279]. Here, however, we merely use the topological polarization to emphasize the importance of evaluating the spontaneous polarization as the change in $\mathbf{P}_{\mathrm{F}}$ along an adiabatic path.

The expressions (97-100) can be applied to extract various properties of non-polar materials as well, for example the Born effective charge tensors

Z^{a}_{\alpha\beta}=V_{\mathrm{cell}}\frac{\partial P_{\beta}}{\partial r_{% \alpha}^{a}},

(101)

which yield the change in polarization resulting from small shifts in atomic positions. In GPAW, these are obtained by a simple call to a module that introduces a (user-defined) shift of all atoms in the unit cell and calculates the resulting change in polarization from Eq. (97-99). The Born charges may be combined with the atomic force matrix to calculate equilibrium positions under an applied static electric field and the lattice contribution to the dynamic polarizability can be calculated from the eigenvalues and eigenvectors of the force matrix [280]. In addition, the piezoelectric response can be obtained by calculating the change in polarization in response to external strain [45]. The lattice contribution to the polarizability is typically orders of magnitude smaller than the electronic part, but for ferroelectrics the soft phonon modes associated with spontaneous polarization can give rise to significant lattice polarizabilities.

We exemplify this by the well known case of 2D ferroelectric GeS [281, 282, 283] where we obtain a spontaneous polarization of 490 pC/m [284, 44] in excellent agreement with previous calculations [283]. The lattice and electronic 2D in-plane polarizabilities are $\alpha^{\mathrm{2D}}_{\mathrm{lat}}=4.32$ Å and $\alpha^{\mathrm{2D}}_{\mathrm{el}}=3.75$ Å respectively [45]. For comparison, the non-polar case of 2D MoS₂ yields $\alpha^{\mathrm{2D}}_{\mathrm{lat}}=0.09$ Å and $\alpha^{\mathrm{2D}}_{\mathrm{el}}=6.19$ Å [45].

IX.2 Berry phases and band topology

Topological phases such as the quantum spin Hall state and Chern insulator depend crucially on presence of spin–orbit coupling and the band topology may be obtained from the evolution of $\mathbf{k}$ -space Berry phases across the Brillouin zone. In particular, for insulators the eigenvalues of the Berry phase matrix of occupied states

\gamma_{mn}^{i}(\mathbf{k}_{\perp})=i\int_{0}^{2\pi}\mathrm{d}k_{i}\langle u_{% m}(\mathbf{k})|\partial_{k_{i}}|u_{n}(\mathbf{k})\rangle

(102)

must change by an integer multiple of $2\pi$ when a component of $\mathbf{k}_{\perp}$ (components of $\mathbf{k}$ orthogonal to $k_{i}$ ) is cycled through the Brillouin zone [285]. Here, $u_{n}(\mathbf{k})$ are spinor Bloch states, which are typically not smooth functions of $\mathbf{k}$ . The evaluation of Eq. (102) thus requires the construction of a smooth gauge and in GPAW this is handled by the parallel-transport algorithm of Ref. [286].

The method has been applied to high throughput search for new topological two-dimensional materials [46] and in Fig. 15 we show the calculated Berry phases of 1T’-MoS₂ [287]. Due to time-reversal symmetry, the Berry phases at $\Gamma$ and $Y$ are two-fold degenerate, and therefore no perturbation that conserves time-reversal symmetry can open a gap in the Berry-phase spectrum. This property is diagnostic for the quantum spin Hall insulating state and closely related to the presence of gapless edge states [285].

The eigenvalues of Eq. (102) may also be used to calculate the electronic contribution to the formal polarization, since the sum of all individual Berry phases yields the same value as one may obtain from Eq. (98). The present approach is, however, more involved since it requires the construction of a smooth gauge, which is not needed in Eq. (98).

IX.3 Wannier functions

Wannier functions (WFs) provide a localized representation of the electronic states of a solid. The WFs are defined by a unitary transformation of the Bloch eigenstates that minimises the spatial extent of the resultant orbitals. Specifically, the $n$ th Wannier function in unit cell $i$ is written as

w_{in}(\mathbf{r})=\frac{1}{N_{\mathbf{k}}}\sum_{\mathbf{k}}e^{-i\mathbf{k}% \cdot\mathbf{R}_{i}}\tilde{\psi}_{n\mathbf{k}}(\mathbf{r}),

(103)

where $\tilde{\psi}_{n\mathbf{k}}$ is a generalized Bloch function (a superposition of Bloch eigenstates at $\mathbf{k}$ ).

The minimization of the spatial extent of a set of WFs $\{w_{n}(\mathbf{r})\}_{n=1}^{N_{w}}$ is equivalent to the maximization of the spread functional [288]

\Omega=\sum_{n=1}^{N_{w}}\sum_{\alpha=1}^{N_{q}}W_{\alpha}|Z_{\alpha,nn}|^{2},

(104)

where

Z_{\alpha,nn}=\langle w_{n}|e^{-i\mathbf{q}_{\alpha}\cdot\mathbf{r}}|w_{n}\rangle.

(105)

The $\{\mathbf{q}_{\alpha}\}$ is a set of at most 6 reciprocal vectors connecting a $\mathbf{k}$ -point to its neighbors and $W_{\alpha}$ are corresponding weights accounting for the shape of the unit cell [289].

The generalized Bloch functions of Eq. (103) are determined by minimizing $\Omega$ using e.g. a conjugate gradient scheme as implemented in the ASE Wannier module. The inputs to this Wannierization algorithm are the matrices

$\displaystyle Z_{\alpha,ij}^{(0),\mathbf{k}}$	$\displaystyle=$	$\displaystyle\langle\tilde{\psi}_{\mathbf{k}i}\|e^{-i\mathbf{q}_{\alpha}\cdot% \mathbf{r}}\|\tilde{\psi}_{\mathbf{k}+\mathbf{q}_{\alpha},j}\rangle$	(106)
	$\displaystyle+$	$\displaystyle\sum_{aii^{\prime}}e^{-i\mathbf{q}_{\alpha}\cdot\mathbf{R}^{a}}% \Delta S_{ii^{\prime}}^{a}$
		$\displaystyle\times\langle\tilde{\psi}_{\mathbf{k}i}\|\tilde{p}_{i}^{a}\rangle% \langle\tilde{p}_{i^{\prime}}^{a}\|\tilde{\psi}_{\mathbf{k}+\mathbf{q}_{\alpha}% ,j}\rangle,$

where $\Delta S_{ii^{\prime}}^{a}$ are the PAW corrections from Eq. 8. From these matrices, the ASE Wannier module can be used to construct partially occupied Wannier functions [290, 291], which are a generalization of maximally-localized Wannier functions [286] to entangled bands and non-periodic systems. Recently, a further improvement in terms of robustness of the Wannierisation procedure was achieved using a modified spread functional containing a penalty term proportional to the variance of the spread distribution of the WFs, which leads to a more uniform spread distribution [292].

IX.4 Point defect calculations with hybrid functionals

Point defects play a crucial role in many applications of semiconductors[293, 294]. First-principles calculations can be used to determine the atomic structure, formation energy, and charge-transition levels of points defects. It is well established that the best description of point defects in semiconductors/insulators is obtained using range separated hybrids, such as the HSE06 xc-functional[5, 295]. To illustrate the use of GPAW for point defect calculations, we determine the formation energy diagrams of the C_N and C_B defects in the hexagonal boron nitride (hBN) crystal with the HSE06 functional. These defects have been proposed to be responsible for the deep-level luminescence signal with a zero-phonon line (ZPL) around 4.1 eV.[296, 295]. The results are compared to similar results obtained with the VASP software package.

For a point-defect $\mathrm{D}$ in charge state $q$ , the formation energy E^f is calculated from the formula,

E^{\mathrm{f}}\left[\mathrm{D}^{q}\right]=E_{\mathrm{tot}}\left[\mathrm{D}^{q}% \right]-E_{\mathrm{tot}}^{\mathrm{bulk}}-\sum_{i}n_{i}\mu_{i}+qE_{\mathrm{F}}+% E_{\mathrm{corr}}.

(107)

Here $E_{\mathrm{tot}}\left[\mathrm{D}^{q}\right]$ and $E_{\mathrm{tot}}^{\mathrm{bulk}}$ are the total energies of the crystal with the point defect in charge state $q$ and of the neutral pristine crystal, respectively. $\mu_{i}$ is the chemical potential of the element $i$ while $n_{i}$ is the number of atoms added to ( $n_{i}>0$ ) or removed from ( $n_{i}<0$ ) the crystal to create the defect. $E_{\mathrm{F}}$ is the chemical potential of the electrons, i.e. the Fermi level, which is written as $E_{\mathrm{F}}=E_{\mathrm{VBM}}+\Delta E_{\mathrm{F}}$ , where VBM is the valence band maximum. Finally, E_corr is a correction term which accounts for: (i) the spurious electrostatic interaction between the periodic images of the defect and their interaction with the compensating homogeneous background charge and (ii) the potential shift between the pristine and defect system.

For more details on the methodology of point defect calculations we refer the reader to the excellent review papers on the topic [297, 298, 299, 300].

All calculations have been performed using the HSE06 functional with the default mixing parameter $\alpha$ =0.25, plane wave cut-off of 800 eV, and forces converged to 0.01 eV/Å. The lattice of the hBN crystal was fixed at the experimental parameters ( $a=2.50$ Å and $c=6.64$ Å) [301]. The band gap of the pristine crystal was determined to be 5.58 eV (using $8\times 8\times 4$ $k$ -points) in good agreement with the experimental band gap of 6.08 eV [302]. The structure of the point defects were relaxed in a $4\times 4\times 2$ (128 atom) supercell using $\Gamma$ -point $k$ -point sampling. For each defect three different charge states ( $q=1,0,-1$ ) were considered. The corrections ( $E_{\mathrm{corr}}$ ) due to image charges and potential alignment are evaluated following Freysoldt, Neugebauer and Van de Walle [303] as implemented in GPAW.

Figure 16 shows the defect formation energies as a function of Fermi level for C_N and C_B, at N-rich and N-poor conditions, respectively. We can see that under N-rich conditions, C_B is energetically lower, whereas C_N is favorable under N-poor conditions. C_B shows a 1/0 charge transition at 3.73 eV above the VBM, whereas C_N has a 0/-1 charge transition 3.26 eV deep inside the band gap. We find good agreement with a similar study [295] also employing plane waves and the HSE06 functional (VASP calculations). Minor discrepancies can be attributed to use of a different supercell size and slightly higher fraction of the non-local mixing parameter ( $\alpha$ =0.31).

IX.5 Point-group symmetry representations

GPAW allows for the automatized assignment of point-group symmetry representations for the pre-computed Kohn–Sham wavefunctions. This can be used for determining the wave-function symmetry representations for both molecules [304] and extended structures [305] to analyze, for example, the symmetry-involved degeneracy of the bands and selection rules for dipole transitions.

The analysis follows directly from group theory [306], stating that the solutions to the Schrödinger equation inherit the symmetry group of the respective Hamiltonian, or essentially the external potential invoked by the atomic configuration. The representation matrices $\Gamma$ are computed as

\Gamma_{n}(T)=\int\phi_{n}^{\dagger}(\mathbf{r})P(T)\phi_{n}(\mathbf{r})% \mathrm{d}\mathbf{r},

(108)

where $\phi$ is a normalized wavefunction, $n$ is the eigenstate label, and $P(T)$ is an operation that corresponds to the transformation $T$ of the symmetry group of the Hamiltonian. The operations $P(T)$ include rotations that are non-trivial for the rectangular grid that the computed wavefunctions are projected onto. The wavefunction rotations are performed on the grid by cubic interpolation. The output of the analysis contains the irreducible representation weights $c_{\alpha,n}$ for each eigenstate $n$ as solved from

\Gamma_{n}(T)=\sum_{\alpha}c_{\alpha,n}\chi_{\alpha}(T),

(109)

where $\chi_{\alpha}$ are the character vectors of the group (i.e. rows of the character table).

When doing the analysis, the user needs to input the coordinates of the center of symmetry (typically the coordinates of a single atom), and the point group for which the analysis is run. It is possible to analyze only a part of the wave function by selecting a cutoff radius from the center of symmetry beyond which the parts of the wave function are neglected. This enables the investigation of the purity of the local symmetry even if the symmetry of the Hamiltonian is broken far from the center of symmetry [304, 305]. To date, point groups of C₂, C_2v, C_3v, D_2d, D_3h, D₅, D_5h, I, I_h, O_h, T_d, and T_h are implemented.

IX.6 Band-structure unfolding

When studying defect formation, charge-ordered phases or structural phase transitions, it is often needed to perform DFT calculations on a super-cell. A super-cell (SC) calculation comes with the cost of having to account for many more electrons in the unit cell when compared to the primitive cell (PC). This implies that besides the increased computational effort, the band structure of a SC contains more bands in a smaller Brillouin zone as compared to the PC. In order to compare electronic band structures between SC and PC, unfolding the band structure of the SC into the one of the primitive cell (PC) becomes convenient.

GPAW features the possibility of performing band-structure unfolding in the real-space grid, plane wave, and LCAO modes. The implementation allows to unfold the SC band structure without the explicit calculation of the overlap between SC and PC wavefunctions, following the procedure described in Ref. [307]. The unfolded band structure is given in terms of the spectral function

A(\mathbf{k},\epsilon)=\sum_{m}P_{\mathbf{K}m}(\mathbf{k})\delta(\epsilon_{% \mathbf{K}m}-\epsilon),

(110)

with $\mathbf{k}$ , $\mathbf{K}$ momenta in the PC and SC Brillouin zone respectively, $\epsilon_{\mathbf{K}m}$ the SC eigenvalues obtained for momentum $\mathbf{K}$ and band index $m$ , and $P_{\mathbf{K}m}(\mathbf{k})$ is calculated as

P_{\mathbf{K}m}(\mathbf{k})=\sum_{\{\mathbf{G}\}}|C_{\mathbf{K}m}(\mathbf{G}+% \mathbf{k}-\mathbf{K})|^{2},

(111)

where $C_{\mathbf{K}m}$ are the Fourier coefficients of the eigenstate $|\mathbf{K}m\rangle$ and $\{\mathbf{G}\}$ the subset of reciprocal space vectors of the SC that match the reciprocal space vectors of the PC. A more detailed explanation and technical details on how to perform a band-structure unfolding can be found on the GPAW web-page [61].

IX.7 The QEH model

The quantum-electrostatic heterostructure (QEH) [308] model is an add-on GPAW feature for calculating the dielectric response and excitations in vertical stacks of 2D materials, also known as van der Waals (vdW) heterostructures. The QEH model can be used independently from the GPAW code, but it relies on the GPAW implementation for the calculation of the fundamental building blocks used by the model, as elaborated below.

The dielectric screening in 2D materials is particularly sensitive to changes in the environment and depends on the stacking order and thickness of the 2D heterostructure, providing a means to tune the electronic excitations, including quasi-particle band gaps and excitons. While the dielectric response of freestanding layers can be explicitly represented ab initio in GPAW at the linear-response TDDFT, GW, and BSE level of theory, lattice mismatch between different 2D layers often results in large supercells that make these many-body approaches infeasible. Since the interaction between stacked layers is generally governed by van der Waals interactions, the main modification to the non-interacting layers’ dielectric response arises from the long-range electrostatic coupling between layers.

Therefore, in the QEH model, the dielectric function of the vdW heterostructure is obtained through an electrostatic coupling of the quantum dielectric building blocks of individual 2D layers [309]. The dielectric building blocks consist of monopole and dipole components of the density-response function of the freestanding layers, $\tilde{\chi_{i}}(\mathbf{q}_{\parallel},\omega)$ , calculated from ab initio on the RPA level. Subsequently, the full density-response function $\chi_{i,j}(\mathbf{q}_{\parallel},\omega)$ (density perturbation on layer $i$ due to a monopole or dipole component of the perturbing field acting on layer $j$ ) is calculated by solving the Dyson equation

	$\displaystyle\chi_{i,j}(\mathbf{q}_{\parallel},\omega)$	$\displaystyle=\tilde{\chi_{i}}(\mathbf{q}_{\parallel},\omega)\delta_{i,j}+$
		$\displaystyle\tilde{\chi_{i}}(\mathbf{q}_{\parallel},\omega)\sum_{k\neq i}V_{i% ,k}(\mathbf{q}_{\parallel})\chi_{k,j}(\mathbf{q}_{\parallel},\omega),$		(112)

where the Coulomb matrix element is obtained as the real-space overlap over the density, $\rho$ , and potential, $\phi$ , basis functions on the different layers.

V_{i,k}(\mathbf{q}_{\parallel})=\int\rho_{i}(z,\mathbf{q}_{\parallel})\phi_{k}% (z,\mathbf{q}_{\parallel})\mathrm{d}z.

(113)

From the density-response function, the dielectric function is obtained in the basis of monopole/dipole perturbations on each layer in the heterostructure. While building blocks pre-computed with GPAW for a large variety of 2D materials are provided with the QEH package, GPAW offers the possibility of calculating custom building blocks for any 2D material as explained in Ref. [310].

As an illustrative example of the QEH model, we show how the static dielectric function of a heterostructure can be engineered by multi-layer stacking. Fig. 17 shows the static dielectric function of a vdW heterostructure made up by stacking $N$ MoS₂ layers and $N$ WSe₂ layers. We see that the dielectric function increases significantly as a function of the number of layers, eventually approaching a bulk limit. The knowledge of the layer-dependence of the dielectric response for such a heterostructeres could be further exploited to investigate inter- and intra-layer excitonic properties and band-edge renormalization effects.

IX.8 Solvent models

The presence of a solvent has a large effect on the energetics and the electronic structure of molecules or extended surfaces. In particular, the arguably most important solvent, water, is able to stabilize ions or zwitterions that would not form in the gas phase. The main effect relates to the large permittivity of water ( $\varepsilon_{r}=78$ ) that effectively screens Coulomb interactions.

A convenient and computationally lean method to describe this effect is the inclusion of a position-dependent solvent permittivity $\varepsilon(\mathbf{r})$ in the electrostatics via the Poisson solver [311]. The solvent is represented solely as a polarizable continuum that averages out all movements and re-arrangements of the solvent molecules and their electrons. The computational cost is, therefore, practically the same as a calculation in vacuum.

This implementation allows the calculation of solvation free energies of neutral and ionic species in solution [311]. Further, it can be applied to periodic surfaces interfaced with an electrolyte to reproduce reasonable potential drops within the simulation of electrochemical reaction processes, as we will elaborate in the following.

IX.9 Charged electrochemical interfaces

Simulating atomistic processes at a solid–liquid interface held at a controlled electrode potential is most appropriately performed in the electronically grand-canonical ensemble [312, 313, 314, 315]. Here, electrons can be exchanged dynamically with an external electron reservoir at a well-defined electrochemical potential. In a periodic system, a non-zero net charge would lead to divergence of the energy; therefore, any fractional electrons that are added to or removed from the system must be compensated by an equal amount of counter charge. Several approaches able to account for this change in boundary conditions have recently been brought forward [316, 317, 318, 319, 320]. In GPAW, this is conveniently accomplished with the introduction of a jellium slab of equal and opposite charge to the needed electronic charge; the jellium is embedded in an implicit solvent localized in the vacuum region above the simulated surface (cf. IX.8). As a particular highlight of this Solvated Jellium Method (SJM) [321], as it is known in GPAW, we are able to localize the excess charge on only the top side of the simulated atomistic surface, which occurs naturally by introducing the jellium region solely in the top-side vacuum and electrostatically decoupling the two sides of the cell via a dipole correction. Both a purely implicit and hybrid explicit-implicit solvent can be applied.

In the SJM, the simulated electrode potential is a monotonic function of the number of electrons in the simulation; calculations can be run in either a constant-charge or a constant-potential ensemble. The electrode potential ( $\phi_{\mathrm{e}}$ ) within SJM is defined as the Fermi level ( $\mu$ ) referenced to an electrostatic potential deep in the solvent (the solution inner potential $\Phi_{\mathrm{w}}$ ), where the whole charge on the electrode has been screened and no electric field is present,

\phi_{\mathrm{e}}=\Phi_{\mathrm{w}}-\mu.

(114)

We can relate $\phi_{\mathrm{e}}$ to the commonly used reference potentials, for example to the standard hydrogen electrode, by subtracting its absolute potential such as the experimental value of 4.44 V as reported by Trasatti [322]. In practice, the reference potentials depend on the used solvent model [323], and the reference can be calibrated using computed and measured potentials of zero charge [324].

The energy used in the analysis of electrode reactions is the grand-potential energy $\Omega$ ,

\Omega\equiv E_{\mathrm{tot}}+\phi_{\mathrm{e}}N_{\mathrm{e}},

(115)

where $N_{\mathrm{e}}$ are the excess electrons in the simulation. This allows for energetic comparisons between calculations with different numbers of electrons, and is the default energy returned to atomistic methods by SJM. While $E_{\mathrm{tot}}$ is consistent with the forces in traditional electronic-structure calculations, the grand-potential energy $\Omega$ is consistent with the forces in constant-potential simulations [325, 315]. This means that relaxations that follow forces will correctly find local minima in $\Omega$ , and any kind of structure optimization or dynamical routine can be performed on the grand-canonical potential energy surface, such as the search for saddle points [326, 327, 328] or molecular-dynamics simulations [329].

In constant-potential mode, the potential is controlled by a damped iterative technique that varies $N_{\mathrm{e}}$ to find the target $\phi_{\mathrm{e}}$ ; in practice, a trajectory (such as a relaxation or nudged elastic band) is run, where in a first series of SCF cycles the potential equilibrates. Upon achieving a target potential within the given threshold, the code will conduct the chosen geometry-optimization routine under constant potential control.

IX.10 Constrained DFT

Constrained DFT (cDFT) [330, 331, 332] is a computationally efficient method for constructing diabatic or charge/spin-localized states. GPAW includes a real-space implementation of cDFT [333], which can be used in both the FD and LCAO modes. Compared to most cDFT implementations, in GPAW the periodicity can be chosen flexibly between isolated molecules and periodic systems in one-, two-, or three-dimensions with $\mathbf{k}$ -point sampling.

The key difference of cDFT compared to normal DFT is the introduction of an auxiliary potential to force a certain region (in real space around a molecule, molecular fragment, or atom) to carry a predefined charge or spin. This leads to a modified energy functional

\begin{split}F[n(\mathbf{r}),\{V_{i}\}]&=E^{\mathrm{KS}}[n(\mathbf{r})]+\\ &\sum_{i}V_{i}\sum_{\sigma}\left(\int\mathrm{d}\mathbf{r}w^{\sigma}_{i}(% \mathbf{r})n^{\sigma}(\mathbf{r})-N_{i}\right),\end{split}

(116)

where $E^{\mathrm{KS}}$ is the Kohn–Sham energy functional, $\sigma$ denotes the spin, $n^{\sigma}(\mathbf{r})$ is the spin-dependent electron density, and $N_{i}$ is the predefined charge or spin constraint. $V_{i}$ acts as Lagrange multiplier, which determines the strength of the auxiliary potential and needs to be determined self-consistently as discussed below. $w^{\sigma}_{i}(\mathbf{r})$ is the weight function that defines how the charge or spin are to be partitioned, i.e. the regions where charge/spin is to be localised. This will be discussed in some detail below.

Introducing the constraining term in Eq. (116) leads to a new localized, spin-dependent external potential

v_{\text{eff}}^{\sigma}(\mathbf{r})=\dfrac{\delta F[n(\mathbf{r})]}{\delta n(% \mathbf{r})}=\dfrac{\delta E^{\mathrm{KS}}[n(\mathbf{r})]}{\delta n(\mathbf{r}% )}+\sum_{i}V_{i}\sum_{\sigma}w^{\sigma}_{i}(\mathbf{r}).

(117)

The constraint is further enforced by demanding that the ${V_{i}}$ need to satisfy the chosen constraints

C\geq\bigg{\lvert}\sum_{\sigma}\int\mathrm{d}\mathbf{r}w^{\sigma}_{i}(\mathbf{% r})n^{\sigma}(\mathbf{r})-N_{i}\bigg{\rvert}.

(118)

In practice, the strength of the constraining potential ${V_{i}}$ is found through a self-consistent two-stage optimization of both $\{V_{i}\}$ and $n(\mathbf{r})$ . As the derivatives of $F[n(\mathbf{r}),\{V_{i}\}]$ with respect to ${V_{i}}$ are readily available [333], gradient-based optimization algorithms in SciPy [68] are used for optimizing $\{V_{i}\}$ . The weight functions are defined by a Hirshfeld-type partitioning scheme with Gaussian atomic densities, and $w^{\sigma}_{i}(\mathbf{r})$ and the resulting external potential are presented on the grid. With these definitions, the forces resulting from the cDFT external potential can be analytically computed and used in e.g. geometry optimization or molecular-dynamics simulations.

cDFT has been widely used for computing transfer rates within Marcus theory, which depends on the reorganization and reaction (free) energies and the diabatic coupling matrix element; the GPAW-cDFT implementation includes all the needed tools for obtaining these parameters for bulk, surface, and molecular systems [333, 334]. Recently, the cDFT approach has been combined with molecular-dynamics methods to compute the reorganization energy at electrochemical interfaces [335] as well as with grand-canonical DFT methods (see section IX.9) to construct fixed electron-potential diabatic states [336].

IX.11 Orbital-free DFT

Orbital-free DFT (OFDFT) approximates the DFT energy functional by modelling the kinetic energy as a direct functional of the density

\begin{split}E_{\text{OF}}[n]=\underbrace{\int\text{d}\textbf{r}\,n^{1/2}(% \textbf{r})\left(-\frac{1}{2}\nabla^{2}\right)\,n^{1/2}(\textbf{r})}_{T_{\text% {W}}[n]}+J[n]+V[n]\\ +E_{\text{xc}}[n]+T_{\text{s}}[n]-T_{\text{W}}[n].\end{split}

(119)

Levy and colleagues showed that a Kohn–Sham-like equation derived variationally from the equation above holds for the square-root of the density [337]

\left(-\frac{1}{2}\nabla^{2}+V_{\text{eff}}(\textbf{r})\right)\,n^{1/2}(% \textbf{r})=\mu\,n^{1/2}(\textbf{r}).

(120)

OFDFT approximately enforces the Pauli principle, partially accounting for quantum effects in an averaged way.

The OFDFT scheme implemented in GPAW offers the advantage of accessing all-electron values while maintaining computational linear-scaling time with respect to system size. To achieve this, we employ the PAW method in conjunction with real-space methods obtaining a mean absolute error of 10 meV per atom when compared to reference all-electron values [338].

While OFDFT functionals perform better using local pseudopotentials in bulk materials, the OFDFT PAW implementation can be interesting for assessing density functionals. For example, in studying large- $Z$ limits or semiclassical limits of density functionals, the all-electron values allow us to find highly-performing OFDFT functionals [339].

IX.12 Zero-field splitting

The zero-field splitting (ZFS) refers to the energetic splitting of the magnetic sub-levels of a localized triplet state in the absence of a magnetic field [340]. The origin of the ZFS is the magnetic dipole-dipole interactions between the two electrons of the triplet. This interaction is described by a spin Hamiltonian of the form ( $\alpha,\beta=x,y,z$ ) [341, 342]

\hat{H}_{\mathrm{ZFS}}=\sum_{\alpha\beta}\hat{S}_{a}D_{\alpha\beta}\hat{S}_{% \beta},

(121)

where $\mathbf{\hat{S}}$ is the total spin operator and $\mathbf{D}$ is the ZFS tensor given by

\displaystyle D_{\alpha\beta}

\displaystyle=\frac{1}{2}\frac{\mu_{0}}{4\pi}g_{e}^{2}\mu_{\mathrm{B}}^{2}\int% \rho_{2}(\mathbf{r}_{1},\mathbf{r}_{2})\frac{\delta_{\alpha\beta}{r}^{2}-3r_{% \alpha}r_{\beta}}{r^{5}}\mathrm{d}\mathbf{r}_{1}\mathrm{d}\mathbf{r}_{2}.

(122)

where $r_{\alpha}$ and $r_{\beta}$ denote the Cartesian components of $\mathbf{r}=\mathbf{r}_{1}-\mathbf{r}_{2}$ , $\rho_{2}$ is the two-particle density matrix of the Kohn–Sham ground-state Slater determinant, $\mu_{\mathrm{B}}$ is the Bohr magneton and $g_{e}$ is the Landé splitting factor. GPAW computes the $\mathbf{D}$ -tensor by evaluating the double integral in reciprocal space using the pseudo density including compensation charges [305].

IX.13 Hyperfine coupling

The hyperfine coupling describes the interaction between a magnetic dipole of a nuclear spin, $\mathbf{\hat{I}}^{N}$ , and the magnetic dipole of the electron-spin distribution, $\mathbf{\hat{S}}(\mathbf{r})$ . The interaction is described by the spin Hamiltonian ( $\alpha,\beta=x,y,z$ )

\hat{H}_{\mathrm{HF}}^{N}=\sum_{\alpha\beta}\hat{S}_{\alpha}A_{\alpha\beta}^{N% }\hat{I}^{N}_{\beta}

(123)

where the hyperfine tensor of nucleus $N$ at $\mathbf{R}=\mathbf{0}$ is given by [343]

	$\displaystyle A^{N}_{\alpha\beta}$	$\displaystyle=\frac{2\mu_{0}}{3}g_{e}\mu_{B}g_{N}\mu_{N}\int\delta_{T}(\mathbf% {r})\rho_{s}(\mathbf{r})\mathrm{d}\mathbf{r}$		(124)
		$\displaystyle+\frac{\mu_{0}}{4\pi}g_{e}\mu_{B}g_{N}\mu_{N}\int\frac{3r_{\alpha% }r_{\beta}-\delta_{\alpha\beta}r^{2}}{r^{5}}\rho_{s}(\mathbf{r})\mathrm{d}% \mathbf{r}.$		(124)

The first term is the isotropic Fermi contact term, which is proportional to the spin density, $\rho_{s}(\mathbf{r})$ , at the centre of the nucleus. $\delta_{T}(\mathbf{r})$ is a smeared-out $\delta$ -function. $g_{e}$ and $g_{N}$ are the gyromagnetic ratios for the electron and nucleus, and $\mu_{N}$ is the nuclear magneton. The second term represents the anisotropic part of the hyperfine coupling tensor and results from dipole-dipole interactions between nuclear and electronic magnetic moments. GPAW evaluates $A^{N}$ using the pseudo spin density with compensation charges [305].

X Outlook

As described in this review, GPAW is a highly versatile code that is both maintenance-, user-, and developer-friendly at the same time. The continued expansion of the code requires substantial efforts and can be lifted only because of the dedicated team of developers contributing at all levels. There are currently a number of ongoing as well as planned developments for GPAW, which will further improve performance and applicability of the code. We are currently finishing a major refactoring of the code, which will make it even more developer friendly and facilitate easier implementation of new functionality.

Another priority is to improve the parallelization of hybrid functional calculations in plane wave mode by enabling parallelization over bands and $\mathbf{k}$ -points. In the same vein, there is ongoing work to support LCAO-based hybrid functional calculations using a resolution-of-identity approach. A natural next step would then be LCAO-based GW calculations. Such a method could potentially be very efficient compared to plane wave calculations, but it is currently unclear if the accuracy can be maintained with the limited LCAO basis. In relation to quasiparticle calculations, there are plans to implement (quasiparticle) self-consistent GW and vertex corrected GW using nonlocal xc-kernels from TDDFT. Constrained RPA calculations that provide a partially screened Coulomb interaction useful for ab initio calculation of interaction parameters in low-energy model Hamiltonians is currently being implemented.

Underlying any GPAW calculation are the PAW potentials. The current potentials date back to 2009. A new set of potentials, including both soft and norm-conserving potentials (for response function calculations), is being worked on.

As described herein, GPAW already has an efficient implementation of real time TDDFT in the LCAO basis while Ehrenfest dynamics is supported only the comparatively slower in grid mode. Work to enable Ehrenfest dynamics in LCAO mode is ongoing.

The current version of GPAW supports GPU acceleration only for standard ground state calculations. The CuPy library greatly simplifies the task of porting GPAW to GPU and we foresee that large parts of the code, including more advanced features such as linear response and GW calculations, will become GPU compatible.

XI Acknowledgements

K. S. T. acknowledges funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program Grant No. 773122 (LIMA) and Grant agreement No. 951786 (NOMAD CoE). K. S. T. is a Villum Investigator supported by VILLUM FONDEN (grant no. 37789). Funding for A. O. D. and G. L. was provided by the Icelandic Research Fund (grant nos. 196279 and 217734, respectively). F. N has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No. 899987. M. M. M. was supported by the Academy of Finland (grant #338228). T. O. acknowledges support from the Villum foundation Grant No. 00029378. S. K. and K. W. J. acknowledge support from the VILLUM Center for Science of Sustainable Fuels and Chemicals, which is funded by the VILLUM Fonden research grant (9455). T. B. was funded by the Danish National Research Foundation (DNRF 146). J. S. acknowledges funding from the Independent Research Fund Denmark (DFF- FTP) through grant no. 9041-00161B. C. S. acknowledges support from the Swedish Research Council (VR) through Grant No. 2016-06059 and funding from the Horizon Europe research and innovation program of the European Union under the Marie Skłodowska-Curie grant agreement no. 101065117. Partially funded by the European Union. Views and opinions expressed are, however, those of the author(s) only and do not necessarily reflect those of the European Union or REA. Neither the European Union nor the granting authority can be held responsible for them. T. S. received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 756277-ATMEN). O. L.-A. have been supported by Minciencias and University of Antioquia (Colombia). K. T. W. was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division, Catalysis Science Program to the SUNCAT Center for Interface Science and Catalysis. G. K. acknowledges funding from V-Sustain: The VILLUM Centre for the Science of Sustainable Fuels and Chemicals (grant no. 9455). Additional funding: Knut and Alice Wallenberg Foundation (2019.0140; J. F. and P. E.), the Swedish Research Council (2020-04935; J. F. and P. E.), the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 838996 (T. R.) and 101065117 (C. S.). Computations were enabled by resources provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS) at NSC, PDC, and C3SE partially funded by the Swedish Research Council through grant agreement no. 2022-06725.

References

[1] Pierre Hohenberg and Walter Kohn. Inhomogeneous electron gas. Phys. Rev., 136(3B):B864, 1964.
[2] Walter Kohn and Lu Jeu Sham. Self-consistent equations including exchange and correlation effects. Phys. Rev., 140(4A):A1133, 1965.
[3] John P Perdew, Kieron Burke, and Matthias Ernzerhof. Generalized gradient approximation made simple. Phys. Rev. Lett., 77(18):3865, 1996.
[4] Axel D Becke. Density-functional thermochemistry. iii. the role of exact exchange. The Journal of Chemical Physics, 98:5648–5652, 1993.
[5] Jochen Heyd, Gustavo E Scuseria, and Matthias Ernzerhof. Hybrid functionals based on a screened coulomb potential. The Journal of Chemical Physics, 118(18):8207–8215, 2003.
[6] Glenn M Torrie and John P Valleau. Nonphysical sampling distributions in monte carlo free-energy estimation: Umbrella sampling. J. Comput. Phys., 23(2):187–199, 1977.
[7] Petros Souvatzis, Olle Eriksson, MI Katsnelson, and SP Rudin. Entropy driven stabilization of energetically unstable crystal structures explained from first principles theory. Phys. Rev. Lett., 100(9):095901, 2008.
[8] Mark S. Hybertsen and Steven G. Louie. Electron correlation in semiconductors and insulators: Band gaps and quasiparticle energies. Phys. Rev. B, 34:5390–5413, Oct 1986.
[9] Dorothea Golze, Marc Dvorak, and Patrick Rinke. The GW compendium: A practical guide to theoretical photoemission spectroscopy. Front. Chem., 7:377, 2019.
[10] Mark E Casida, Christine Jamorski, Kim C Casida, and Dennis R Salahub. Molecular excitation energies to high-lying bound states from time-dependent density-functional response theory: Characterization and correction of the time-dependent local density approximation ionization threshold. The Journal of Chemical Physics, 108(11):4439–4449, 1998.
[11] Giovanni Onida, Lucia Reining, and Angel Rubio. Electronic excitations: density-functional versus many-body green’s-function approaches. Rev. Mod. Phys., 74(2):601, 2002.
[12] Jens Kehlet Nørskov, Thomas Bligaard, Jan Rossmeisl, and Claus Hviid Christensen. Towards the computational design of solid catalysts. Nat. Chem., 1(1):37–46, 2009.
[13] Anubhav Jain, Shyue Ping Ong, Geoffroy Hautier, Wei Chen, William Davidson Richards, Stephen Dacek, Shreyas Cholia, Dan Gunter, David Skinner, Gerbrand Ceder, et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater., 1(1):011002, 2013.
[14] Stefano Curtarolo, Gus LW Hart, Marco Buongiorno Nardelli, Natalio Mingo, Stefano Sanvito, and Ohad Levy. The high-throughput highway to computational materials design. Nat. Mater., 12(3):191–201, 2013.
[15] Sten Haastrup, Mikkel Strange, Mohnish Pandey, Thorsten Deilmann, Per S Schmidt, Nicki F Hinsche, Morten N Gjerding, Daniele Torelli, Peter M Larsen, Anders C Riis-Jensen, et al. The computational 2d materials database: high-throughput modeling and discovery of atomically thin crystals. 2D Mater., 5(4):042002, 2018.
[16] Ivano E Castelli, Thomas Olsen, Soumendu Datta, David D Landis, Søren Dahl, Kristian S Thygesen, and Karsten W Jacobsen. Computational screening of perovskite metal oxides for optimal solar light capture. Energ. Environ. Sci., 5(2):5814–5819, 2012.
[17] Andreas Marek, Volker Blum, Rainer Johanni, Ville Havu, Bruno Lang, Thomas Auckenthaler, Alexander Heinecke, Hans-Joachim Bungartz, and Hermann Lederer. The elpa library: scalable parallel eigenvalue solutions for electronic structure theory and computational science. Journal of Physics: Condensed Matter, 26(21):213201, 2014.
[18] Peitao Liu, Merzuk Kaltak, Jiří Klimeš, and Georg Kresse. Cubic scaling GW: Towards fast quasiparticle calculations. Phys. Rev. B, 94(16):165109, 2016.
[19] Anubhav Jain, Shyue Ping Ong, Wei Chen, Bharat Medasani, Xiaohui Qu, Michael Kocher, Miriam Brafman, Guido Petretto, Gian-Marco Rignanese, Geoffroy Hautier, et al. Fireworks: a dynamic workflow system designed for high-throughput applications. Concurr. Comp-pract. E., 27(17):5037–5059, 2015.
[20] Giovanni Pizzi, Andrea Cepellotti, Riccardo Sabatini, Nicola Marzari, and Boris Kozinsky. Aiida: automated interactive infrastructure and database for computational science. Nato. Sc. S. Ss. Iii. C. S., 111:218–230, 2016.
[21] Jens Jørgen Mortensen, Morten Gjerding, and Kristian Sommer Thygesen. MyQueue: Task and workflow scheduling system. J. Open Source Softw., 5(45):1844, 2020.
[22] Morten Gjerding, Thorbjørn Skovhus, Asbjørn Rasmussen, Fabian Bertoldo, Ask Hjorth Larsen, Jens Jørgen Mortensen, and Kristian Sommer Thygesen. Atomic simulation recipes: A python framework and library for automated workflows. Nato. Sc. S. Ss. Iii. C. S., 199:110731, 2021.
[23] Jonathan Schmidt, Mário RG Marques, Silvana Botti, and Miguel AL Marques. Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater., 5(1):83, 2019.
[24] Albert P Bartók, Mike C Payne, Risi Kondor, and Gábor Csányi. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett., 104(13):136403, 2010.
[25] Matthias Rupp, Alexandre Tkatchenko, Klaus-Robert Müller, and O Anatole Von Lilienfeld. Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett., 108(5):058301, 2012.
[26] Andrew M Teale, Trygve Helgaker, Andreas Savin, Carlo Adamo, Bálint Aradi, Alexei V Arbuznikov, Paul W Ayers, Evert Jan Baerends, Vincenzo Barone, Patrizia Calaminici, et al. DFT exchange: sharing perspectives on the workhorse of quantum chemistry and materials science. Phys. Chem. Chem. Phys., 24(47):28700–28781, 2022.
[27] Xinguo Ren, Patrick Rinke, Christian Joas, and Matthias Scheffler. Random-phase approximation and its applications in computational chemistry and materials science. J. Mater. Sci., 47:7447–7471, 2012.
[28] Jianwei Sun, Adrienn Ruzsinszky, and John P Perdew. Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett., 115(3):036402, 2015.
[29] Thomas Olsen and Kristian S Thygesen. Random phase approximation applied to solids, molecules, and graphene-metal interfaces: From van der Waals to covalent bonding. Phys. Rev. B, 87(7):075111, 2013.
[30] Alexandre Tkatchenko and Matthias Scheffler. Accurate molecular van der Waals interactions from ground-state electron density and free-atom reference data. Phys. Rev. Lett., 102(7):073005, 2009.
[31] Max Dion, Henrik Rydberg, Elsebeth Schröder, David C Langreth, and Bengt I Lundqvist. Van der Waals density functional for general geometries. Phys. Rev. Lett., 92(24):246401, 2004.
[32] J. J. Mortensen, L. B. Hansen, and K. W. Jacobsen. Real-space grid implementation of the projector augmented wave method. Phys. Rev. B, 71:035109, Jan 2005.
[33] J Enkovaara, C Rostgaard, J J Mortensen, J Chen, M Dułak, L Ferrighi, J Gavnholt, C Glinsvad, V Haikola, H A Hansen, H H Kristoffersen, M Kuisma, A H Larsen, L Lehtovaara, M Ljungberg, O Lopez-Acevedo, P G Moses, J Ojanen, T Olsen, V Petzold, N A Romero, J Stausholm-Møller, M Strange, G A Tritsaris, M Vanin, M Walter, B Hammer, H Häkkinen, G K H Madsen, R M Nieminen, J K Nørskov, M Puska, T T Rantala, J Schiøtz, K S Thygesen, and K W Jacobsen. Electronic structure calculations with GPAW: a real-space implementation of the projector augmented-wave method. Journal of Physics: Condensed Matter, 22(25):253202, jun 2010.
[34] Michael Walter, Hannu Häkkinen, Lauri Lehtovaara, Martti Puska, Jussi Enkovaara, Carsten Rostgaard, and Jens Jørgen Mortensen. Time-dependent density-functional theory in the projector augmented-wave method. The Journal of Chemical Physics, 128(24):244101, June 2008.
[35] A. H. Larsen, M. Vanin, J. J. Mortensen, K. S. Thygesen, and K. W. Jacobsen. Localized atomic basis set in the projector augmented wave method. Phys. Rev. B, 80:195112, Nov 2009.
[36] Jun Yan, Jens J Mortensen, Karsten W Jacobsen, and Kristian S Thygesen. Linear density response function in the projector augmented wave method: Applications to solids, surfaces, and interfaces. Phys. Rev. B, 83(24):245122, 2011.
[37] Thomas Olsen and Kristian S Thygesen. Extending the random-phase approximation for electronic correlation energies: The renormalized adiabatic local density approximation. Phys. Rev. B, 86(8):081103, 2012.
[38] Falco Hüser, Thomas Olsen, and Kristian S Thygesen. Quasiparticle gw calculations for solids, molecules, and two-dimensional materials. Phys. Rev. B, 87(23):235132, 2013.
[39] Jun Yan, Karsten W Jacobsen, and Kristian S Thygesen. Optical properties of bulk semiconductors and graphene/boron nitride: The bethe-salpeter equation with derivative discontinuity-corrected density functional energies. Phys. Rev. B, 86(4):045208, 2012.
[40] Joachim Sødequist and Thomas Olsen. Type ii multiferroic order in two-dimensional transition metal halides from first principles spin-spiral calculations. 2D Mater., 10(3):035016, may 2023.
[41] Daniele Torelli and Thomas Olsen. Calculating critical temperatures for ferromagnetic order in two-dimensional materials. 2D Mater., 6:015028, 2018.
[42] Frederik L Durhuus, Thorbjørn Skovhus, and Thomas Olsen. Plane wave implementation of the magnetic force theorem for magnetic exchange constants: application to bulk fe, co and ni. Journal of Physics: Condensed Matter, 35(10):105802, jan 2023.
[43] Thorbjørn Skovhus and Thomas Olsen. Dynamic transverse magnetic susceptibility in the projector augmented-wave method: Application to Fe, Ni, and Co. Phys. Rev. B, 103:245110, 2021.
[44] Mads Kruse, Urko Petralanda, Morten N. Gjerding, Karsten W. Jacobsen, Kristian S. Thygesen, and Thomas Olsen. Two-dimensional ferroelectrics from high throughput computational screening. npj Comput. Mater., 9:45, 2023.
[45] Morten Niklas Gjerding, Alireza Taghizadeh, Asbjørn Rasmussen, Sajid Ali, Fabian Bertoldo, Thorsten Deilmann, Nikolaj Rørbæk Knøsgaard, Mads Kruse, Ask Hjorth Larsen, Simone Manti, Thomas Garm Pedersen, Urko Petralanda, Thorbjørn Skovhus, Mark Kamper Svendsen, Jens Jørgen Mortensen, Thomas Olsen, and Kristian Sommer Thygesen. Recent progress of the computational 2D materials database (C2DB). 2D Mater., 8:044002, 2021.
[46] Thomas Olsen, Erik Andersen, Takuya Okugawa, Daniele Torelli, Thorsten Deilmann, and Kristian S. Thygesen. Discovering two-dimensional topological insulators from high-throughput computations. Phys. Rev. Mater., 3:024005, 2019.
[47] Ask Hjorth Larsen, Jens Jørgen Mortensen, Jakob Blomqvist, Ivano E Castelli, Rune Christensen, Marcin Dułak, Jesper Friis, Michael N Groves, Bjørk Hammer, Cory Hargus, Eric D Hermes, Paul C Jennings, Peter Bjerre Jensen, James Kermode, John R Kitchin, Esben Leonhard Kolsbjerg, Joseph Kubal, Kristen Kaasbjerg, Steen Lysgaard, Jón Bergmann Maronsson, Tristan Maxson, Thomas Olsen, Lars Pastewka, Andrew Peterson, Carsten Rostgaard, Jakob Schiøtz, Ole Schütt, Mikkel Strange, Kristian S Thygesen, Tejs Vegge, Lasse Vilhelmsen, Michael Walter, Zhenhua Zeng, and Karsten W Jacobsen. The atomic simulation environment—a python library for working with atoms. Journal of Physics: Condensed Matter, 29(27):273002, jun 2017.
[48] Arash A Mostofi, Jonathan R Yates, Young-Su Lee, Ivo Souza, David Vanderbilt, and Nicola Marzari. wannier90: A tool for obtaining maximally-localised wannier functions. Comput. Phys. Commun., 178(9):685–699, 2008.
[49] Yorick L. A. Schmerwitz, Gianluca Levi, and Hannes Jónsson. Calculations of Excited Electronic States by Converging on Saddle Points Using Generalized Mode Following. J. Chem. Theory Comput., 19(12):3634–3651, 2023.
[50] Aleksei V. Ivanov, Gianluca Levi, Elvar Ö. Jónsson, and Hannes Jónsson. Method for calculating excited electronic states using density functionals and direct orbital optimization with real space grid or plane-wave basis set. J. Chem. Theory Comput., 17:5034–5049, 8 2021.
[51] Gianluca Levi, Aleksei V. Ivanov, and Hannes Jónsson. Variational density functional calculations of excited states via direct optimization. J. Chem. Theory Comput., 16:6968–6982, 11 2020.
[52] M. Kuisma, A. Sakko, T. P. Rossi, A. H. Larsen, J. Enkovaara, L. Lehtovaara, and T. T. Rantala. Localized surface plasmon resonance in silver nanoparticles: Atomistic first-principles time-dependent density-functional theory calculations. Phys. Rev. B, 91(11):115431, 2015.
[53] Tuomas P. Rossi, Susi Lehtola, Arto Sakko, Martti J. Puska, and Risto M. Nieminen. Nanoplasmonics simulations at the basis set limit through completeness-optimized, local numerical basis sets. The Journal of Chemical Physics, 142(9):094114, 2015.
[54] Tuomas P. Rossi, Mikael Kuisma, Martti J. Puska, Risto M. Nieminen, and Paul Erhart. Kohn–Sham Decomposition in Real-Time Time-Dependent Density-Functional Theory: An Efficient Tool for Analyzing Plasmonic Excitations. J. Chem. Theory Comput., 13(10):4779–4790, 2017.
[55] Esko Makkonen, Tuomas P Rossi, Ask Hjorth Larsen, Olga Lopez-Acevedo, Patrick Rinke, Mikael Kuisma, and Xi Chen. Real-time time-dependent density functional theory implementation of electronic circular dichroism applied to nanoscale metal–organic clusters. The Journal of Chemical Physics, 154(11):114102, 2021.
[56] Alireza Taghizadeh, Ulrik Leffers, Thomas G Pedersen, and Kristian S Thygesen. A library of ab initio raman spectra for automated identification of 2d materials. Nat. Commun., 11(1):3011, 2020.
[57] Alireza Taghizadeh, Kristian S Thygesen, and Thomas G Pedersen. Two-dimensional materials with giant optical nonlinearities near the theoretical upper limit. Acs Nano, 15(4):7155–7167, 2021.
[58] Mikkel Ohm Sauer, Alireza Taghizadeh, Urko Petralanda, Martin Ovesen, Kristian Sommer Thygesen, Thomas Olsen, Horia Cornean, and Thomas Garm Pedersen. Shift current photovoltaic efficiency of 2d materials. npj Comput. Mater., 9(1):35, 2023.
[59] GPAW’s git-repository: https://gitlab.com/gpaw/gpaw/.
[60] Holger Krekel, Bruno Oliveira, Ronny Pfannschmidt, Floris Bruynooghe, Brianna Laugher, and Florian Bruhin. pytest, 2004.
[61] GPAW tutorials and exercises: https://wiki.fysik.dtu.dk/gpaw/tutorialsexercises/tutorialsexercises.html.
[62] Matteo Frigo and Steven G. Johnson. The design and implementation of FFTW3. P. Ieee, 93(2):216–231, 2005. Special issue on “Program Generation, Optimization, and Platform Adaptation”.
[63] L. S. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. ScaLAPACK Users’ Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997.
[64] Miguel A.L. Marques, Micael J.T. Oliveira, and Tobias Burnus. Libxc: A library of exchange and correlation functionals for density functional theory. Comput. Phys. Commun., 183(10):2272–2281, 2012.
[65] Susi Lehtola, Conrad Steigemann, Micael J. T. Oliveira, and Miguel A. L. Marques. Recent developments in libxc — A comprehensive library of functionals for density functional theory. SoftwareX, 7:1–5, January 2018.
[66] Ask Hjorth Larsen, Mikael Kuisma, Joakim Löfgren, Yann Pouillon, Paul Erhart, and Per Hyldgaard. libvdwxc: a library for exchange–correlation functionals in the vdw-df family. Model. Simul. Mater. Sc., 25(6):065004, jun 2017.
[67] Charles R. Harris, K. Jarrod Millman, Stéfan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fernández del Río, Mark Wiebe, Pearu Peterson, Pierre Gérard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke, and Travis E. Oliphant. Array programming with NumPy. Nature, 585(7825):357–362, September 2020.
[68] Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, Matt Haberland, Tyler Reddy, David Cournapeau, Evgeni Burovski, Pearu Peterson, Warren Weckesser, Jonathan Bright, Stéfan J. van der Walt, Matthew Brett, Joshua Wilson, K. Jarrod Millman, Nikolay Mayorov, Andrew R. J. Nelson, Eric Jones, Robert Kern, Eric Larson, C J Carey, İlhan Polat, Yu Feng, Eric W. Moore, Jake VanderPlas, Denis Laxalde, Josef Perktold, Robert Cimrman, Ian Henriksen, E. A. Quintero, Charles R. Harris, Anne M. Archibald, Antônio H. Ribeiro, Fabian Pedregosa, Paul van Mulbregt, and SciPy 1.0 Contributors. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods, 17:261–272, 2020.
[69] P. E. Blöchl. Projector augmented-wave method. Phys. Rev. B, 50:17953–17979, Dec 1994.
[70] Toma Susi, Jacob Madsen, Ursula Ludacka, Jens Jørgen Mortensen, Timothy J. Pennycook, Zhongbo Lee, Jani Kotakoski, Ute Kaiser, and Jannik C. Meyer. Efficient first principles simulation of electron scattering factors for transmission electron microscopy. Ultramicroscopy, 197:16–22, February 2019.
[71] Jacob Madsen, Timothy J. Pennycook, and Toma Susi. ab initio description of bonding for transmission electron microscopy. Ultramicroscopy, 231:113253, December 2021.
[72] Jacob Madsen and Toma Susi. abTEM: transmission electron microscopy from first principles. Open Research Europe, 1(24):13015, 2021.
[73] G. Kresse and J. Furthmüller. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B, 54:11169–11186, Oct 1996.
[74] Nicola Marzari, David Vanderbilt, Alessandro De Vita, and M. C. Payne. Thermal contraction and disordering of the al(110) surface. Phys. Rev. Lett., 82:3296–3299, Apr 1999.
[75] Peter E. Blöchl, O. Jepsen, and O. K. Andersen. Improved tetrahedron method for brillouin-zone integrations. Phys. Rev. B, 49:16223–16233, Jun 1994.
[76] Susi Lehtola, Frank Blockhuys, and Christian Van Alsenoy. An overview of self-consistent field calculations within finite basis sets. Molecules, 25(5):1–23, 2020.
[77] Troy Van Voorhis and Martin Head-Gordon. A geometric approach to direct minimization. Mol. Phys., 100:1713–1721, 2002.
[78] M. C. Payne, M. P. Teter, D. C. Allan, T. A. Arias, and J. D. Joannopoulos. Iterative minimization techniques for ab initio total-energy calculations: Molecular dynamics and conjugate gradients. Rev. Mod. Phys., 64(4):1045–1097, 1992.
[79] Martin Head-Gordon and John A. Pople. Optimization of wave function and geometry in the finite basis Hartree-Fock method. J. Phys. Chem-us., 92(11):3063–3069, 1988.
[80] Aleksei V. Ivanov, Elvar Ö. Jónsson, Tejs Vegge, and Hannes Jónsson. Direct energy minimization based on exponential transformation in density functional calculations of finite and extended systems. Comput. Phys. Commun., 267:108047, 2021.
[81] PAW-XML specification. https://esl.cecam.org/data/paw-xml/.
[82] Xavier Gonze, Bernard Amadon, Gabriel Antonius, Frédéric Arnardi, Lucas Baguet, Jean-Michel Beuken, Jordan Bieder, François Bottin, Johann Bouchet, Eric Bousquet, Nils Brouwer, Fabien Bruneval, Guillaume Brunin, Théo Cavignac, Jean-Baptiste Charraud, Wei Chen, Michel Côté, Stefaan Cottenier, Jules Denier, Grégory Geneste, Philippe Ghosez, Matteo Giantomassi, Yannick Gillet, Olivier Gingras, Donald R. Hamann, Geoffroy Hautier, Xu He, Nicole Helbig, Natalie Holzwarth, Yongchao Jia, François Jollet, William Lafargue-Dit-Hauret, Kurt Lejaeghere, Miguel A.L. Marques, Alexandre Martin, Cyril Martins, Henrique P.C. Miranda, Francesco Naccarato, Kristin Persson, Guido Petretto, Valentin Planes, Yann Pouillon, Sergei Prokhorenko, Fabio Ricci, Gian-Marco Rignanese, Aldo H. Romero, Michael Marcus Schmitt, Marc Torrent, Michiel J. van Setten, Benoit Van Troeye, Matthieu J. Verstraete, Gilles Zérah, and Josef W. Zwanziger. The ABINIT project: Impact, environment and recent developments. Comput. Phys. Commun., 248:107042, 2020.
[83] François Jollet, Marc Torrent, and Natalie Holzwarth. Generation of projector augmented-wave atomic data: A 71 element validated table in the XML format. Comput. Phys. Commun., 185(4):1246–1254, 2014.
[84] C. Hartwigsen, S. Goedecker, and J. Hutter. Relativistic separable dual-space gaussian pseudopotentials from h to rn. Phys. Rev. B, 58:3641–3662, Aug 1998.
[85] Martin Schlipf and François Gygi. Optimization algorithm for the generation of oncv pseudopotentials. Comput. Phys. Commun., 196:36–44, 2015.
[86] CuPy. https://cupy.dev/.
[87] Header only porting. https://github.com/mlouhivu/hop/.
[88] Samuli Hakala, Ville Havu, Jussi Enkovaara, and Risto M. Nieminen. Parallel electronic structure calculations using multiple graphics processing units (gpus). Lect. Notes Comput. Sc., 7782:63–76, 2013.
[89] Samuli Hakala, Jussi Enkovaara, Ville Havu, Jun Yan, Lin Li, Chris O’Grady, and Risto M. Nieminen. Grid-based projector-augmented wave method. Method in Electronic Structure Calculations on Graphics Processing Units, 2015.
[90] John P. Perdew and Karla Schmidt. Jacob’s ladder of density functional approximations for the exchange-correlation energy. In AIP Conference Proceedings, volume 577, pages 1–20. AIP Publishing, July 2001.
[91] Jianmin Tao, John P. Perdew, Viktor N. Staroverov, and Gustavo E. Scuseria. Climbing the Density Functional Ladder: Nonempirical Meta-Generalized Gradient Approximation Designed for Molecules and Solids. Phys. Rev. Lett., 91(14):146401, September 2003.
[92] John P. Perdew and Yue Wang. Accurate and simple analytic representation of the electron-gas correlation energy. Phys. Rev. B, 45(23):13244–13249, June 1992.
[93] Guillermo Román-Pérez and José M. Soler. Efficient implementation of a van der waals density functional: Application to double-wall carbon nanotubes. Phys. Rev. Lett., 103:096102, Aug 2009.
[94] Oleg Gritsenko, Robert van Leeuwen, Erik van Lenthe, and Evert Jan Baerends. Self-consistent approximation to the Kohn-Sham exchange potential. Phys. Rev. A, 51(3):1944, 1995.
[95] M. Kuisma, J. Ojanen, J. Enkovaara, and T. T. Rantala. Kohn-Sham potential with discontinuity for band gap materials. Phys. Rev. B, 82(11):115106, 2010.
[96] Jun Yan, Karsten W. Jacobsen, and Kristian S. Thygesen. First-principles study of surface plasmons on Ag(111) and H/Ag(111). Phys. Rev. B, 84(23):235430, 2011.
[97] Jun Yan, Karsten W. Jacobsen, and Kristian S. Thygesen. Conventional and acoustic surface plasmons on noble metal surfaces: A time-dependent density functional theory study. Phys. Rev. B, 86(24):241404, 2012.
[98] J. Magnus Rahm, Christopher Tiburski, Tuomas P. Rossi, Ferry Anggoro Ardy Nugroho, Sara Nilsson, Christoph Langhammer, and Paul Erhart. A library of late transition metal alloy dielectric functions for nanophotonic applications. Adv. Funct. Mater., 30(35):2002122, 2020.
[99] Keshab Bashyal, Christopher K Pyles, Sajjad Afroosheh, Aneer Lamichhane, and Alexey T Zayak. Empirical optimization of DFT+U and HSE for the band structure of ZnO. Journal of Physics: Condensed Matter, 30(6):065501, jan 2018.
[100] M. K. Y. Chan and G. Ceder. Efficient band gap prediction for solids. Phys. Rev. Lett., 105:196403, Nov 2010.
[101] John P. Perdew. Density functional theory and the band gap problem. Int. J. Quantum Chem., 28(S19):497–523, 1985.
[102] Kevin J. May and Alexie M. Kolpak. Improved description of perovskite oxide crystal structure and electronic properties using self-consistent Hubbard $u$ corrections from ACBN0. Phys. Rev. B, 101:165117, Apr 2020.
[103] S. L. Dudarev, G. A. Botton, S. Y. Savrasov, C. J. Humphreys, and A. P. Sutton. Electron-energy-loss spectra and the structural stability of nickel oxide: An LSDA+U study. Phys. Rev. B, 57:1505–1509, Jan 1998.
[104] A. I. Liechtenstein, V. I. Anisimov, and J. Zaanen. Density-functional theory and strong interactions: Orbital ordering in mott-hubbard insulators. Phys. Rev. B, 52:R5467–R5470, Aug 1995.
[105] F Aryasetiawan, T Miyake, and R Sakuma. The constrained RPA method for calculating the Hubbard U from first-principles. The LDA+ DMFT approach to strongly correlated materials, 2011.
[106] Matteo Cococcioni and Stefano de Gironcoli. Linear response approach to the calculation of the effective interaction parameters in the LDA+U method. Phys. Rev. B, 71(3), jan 2005.
[107] Guy C. Moore, Matthew K. Horton, Alexander M. Ganose, Martin Siron, Edward Linscott, David D. O’Regan, and Kristin A. Persson. High-throughput determination of Hubbard U and Hund J values for transition metal oxides via linear response formalism, 2022.
[108] Lei Wang, Thomas Maxisch, and Gerbrand Ceder. Oxidation energies of transition metal oxides within the $\mathrm{GGA}+\mathrm{U}$ framework. Phys. Rev. B, 73:195107, May 2006.
[109] M F M Taib, D T Mustaffa, N H Hussin, M H Samat, A M M Ali, O H Hassan, and M Z A Yahya. First principles study on Zn doped MgO using Hubbard U correction. Materials Research Express, 6(9):094012, jul 2019.
[110] Maituo Yu, Shuyang Yang, Chunzhi Wu, and Noa Marom. Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization. npj Comput. Mater., 6(1):180, 2020.
[111] Yoshihiro Tawada, Takao Tsuneda, Susumu Yanagisawa, Takeshi Yanai, and Kimihiko Hirao. A long-range-corrected time-dependent density functional theory. The Journal of Chemical Physics, 120(18):8425–8433, May 2004.
[112] Aron J. Cohen, Paula Mori-Sánchez, and Weitao Yang. Insights into Current Limitations of Density Functional Theory. Science, 321(5890):792–794, August 2008.
[113] Andreas Dreuw, Jennifer L. Weisman, and Martin Head-Gordon. Long-range charge-transfer excited states in time-dependent density functional theory require non-local exchange. J. Chem. Phys., 119(6):2943–2946, 2003.
[114] E. J. Baerends, O. V. Gritsenko, and R. van Meer. The Kohn–Sham gap, the fundamental gap and the optical gap: The physical meaning of occupied and virtual Kohn–Sham orbital energies. Phys. Chem. Chem. Phys., 15(39):16408–16425, September 2013.
[115] Stephan Kümmel. Charge-Transfer Excitations: A Challenge for Time-Dependent Density Functional Theory That Has Been Met. Adv. Energy Mater., 7(16):1700440, August 2017.
[116] Roi Baer, Ester Livshits, and Ulrike Salzner. Tuned Range-Separated Hybrids in Density Functional Theory. Annu. Rev. Phys. Chem., 61(1):85–109, 2010.
[117] Carlo Adamo and Vincenzo Barone. Toward chemical accuracy in the computation of NMR shieldings: The PBE0 model. Chem. Phys. Lett., 298(1–3):113–119, December 1998.
[118] Takeshi Yanai, David P Tew, and Nicholas C Handy. A new hybrid exchange–correlation functional using the Coulomb-attenuating method (CAM-B3LYP). Chem. Phys. Lett., 393(1–3):51–57, July 2004.
[119] Yoshinobu Akinaga and Seiichiro Ten-no. Range-separation by the Yukawa potential in long-range corrected density functional theory with Gaussian-type basis functions. Chem. Phys. Lett., 462(4–6):348–351, September 2008.
[120] Michael Seth and Tom Ziegler. Range-Separated Exchange Functionals with Slater-Type Functions. J. Chem. Theory Comput., 8(3):901–907, March 2012.
[121] Ester Livshits and Roi Baer. A well-tempered density functional theory of electrons in molecules. Phys. Chem. Chem. Phys., 9(23):2932–2941, June 2007.
[122] Rolf Würdemann. Berechnung optischer Spektren und Grundzustandseigenschaften neutraler und geladener Moleküle mittels Dichtefunktionaltheorie. PhD thesis, Universität Freiburg, Freiburg, Germany, July 2016.
[123] Rolf Würdemann and Michael Walter. Charge Transfer Excitations with Range Separated Functionals Using Improved Virtual Orbitals. J. Chem. Theory Comput., 14(7):3667–3676, July 2018.
[124] Perdew J. P. and A. Zunger. Self-interaction correction to density-functional approximations for many-electron systems. Phys. Rev. B, 23:5048, 1981.
[125] S. Klüpfel, P. J. Klüpfel, and H. Jónsson. Importance of complex orbitals in calculating the self-interaction corrected ground state of atoms. Phys. Rev. A, 84:050501(R), 2011.
[126] S. Lehtola and H. Jónsson. Variational, self-consistent implementation of the perdew-zunger self-interaction correction with complex optimal orbitals. J. Chem. Theory Comput., 10:5324, 2014.
[127] S. Lehtola, M. Head-Gordon, and H. Jónsson. Complex orbitals, multiple local minima, and symmetry breaking in perdew-zunger self-interaction corrected density functional theory calculations. J. Chem. Theory Comput., 12:3195, 2016.
[128] H. Jónsson. Simulation of surface processes. Proceedings of the National Academy of Sciences, 108:944, 2011.
[129] S. Klüpfel, P. Klüpfel, and H. Jónsson. The effect of the perdew-zunger self-interaction correction to density functionals on the energetics of small molecules. J. Chem. Phys., 137:124102, 2012.
[130] H. Gudmundsdóttir, Y. Zhang, P. M. Weber, and H. Jónsson. Self-interaction corrected density functional calculations of molecular rydberg states. J. Chem. Phys., 139:194102, 2013.
[131] A. V. Ivanov, T. Ghosh, E. Ö. Jónsson, and H. Jónsson. Mn dimer can be described accurately with density functional calculations when self-interaction correction is applied. J. Phys. Chem. Lett., 12:4240, 2021.
[132] H. Gudmundsdóttir, E. Ö. Jónsson, and H. Jónsson. Calculations of al dopant in alpha-quartz using a variational implementation of the perdew-zunger self-interaction correction. New J. Phys., 17:083006, 2015.
[133] Yorick L. A. Schmerwitz, Aleksei V. Ivanov, Elvar Ö. Jónsson, Hannes Jónsson, and Gianluca Levi. Variational density functional calculations of excited states: Conical intersection and avoided crossing in ethylene bond twisting. J. Phys. Chem. Lett., 13:3990–3999, 2022.
[134] J. J. Mortensen, K. Kaasbjerg, S. L. Frederiksen, J. K. Nørskov, J. P. Sethna, and K. W. Jacobsen. Bayesian Error Estimation in Density-Functional Theory. Phys. Rev. Lett., 95(21):216401, November 2005.
[135] Jess Wellendorff, Keld T. Lundgaard, Andreas Møgelhøj, Vivien Petzold, David D. Landis, Jens K. Nørskov, Thomas Bligaard, and Karsten W. Jacobsen. Density functionals for surface science: Exchange-correlation model development with Bayesian error estimation. Phys. Rev. B, 85:235149, 2012.
[136] Jess Wellendorff, Keld T. Lundgaard, Karsten W. Jacobsen, and Thomas Bligaard. mBEEF: An accurate semi-local Bayesian error estimation density functional. The Journal of Chemical Physics, 140(14):144107, April 2014.
[137] Keld T. Lundgaard, Jess Wellendorff, J. Voss, Karsten W. Jacobsen, and Thomas Bligaard. mBEEF-vdW: Robust fitting of error estimation density functionals. Phys. Rev. B, 93(23):235162, 2016.
[138] Andrew J. Medford, Jess Wellendorff, Aleksandra Vojvodic, Felix Studt, Frank Abild-Pedersen, Karsten W. Jacobsen, Thomas Bligaard, and Jens K. Nørskov. Assessing the reliability of calculated catalytic ammonia synthesis rates. Science, 345(6193):197–200, 2014.
[139] G Ciccotti, M Ferrario, and J P Ryckaert. Molecular dynamics of rigid systems in cartesian coordinates a general formulation. Mol. Phys., 47:1253–1264, 1982.
[140] Andrew A Peterson. Global optimization of adsorbate–surface structures while preserving molecular identity. Top. Catal., 57:40–53, 2014.
[141] E B Tadmor, G S Smith, N Bernstein, and E Kaxiras. Mixed finite element and atomistic formulation for complex crystals. Phys. Rev. B, 59:235, 1999.
[142] C G Broyden. The convergence of a class of double-rank minimization algorithms. J. I. Math. Appl., 6:76–90, 1970.
[143] R Fletcher. A new approach to variable metric algorithms. Comput. J., 13:317–322, 1970.
[144] D Goldfarb. A family of variable metric updates derived by variational means. Math. Comput., 24:23–26, 1970.
[145] D F Shanno. Conditioning of quasi-newton methods for function minimization. Math. Comput., 24:647–656, 1970.
[146] D Liu and J Nocedal. On the limited memory method for large scale optimization. Mathematical Programming B, 45:503–528, 1989.
[147] Erik Bitzek, Pekka Koskinen, Franz Gahler, Michael Moseler, and Peter Gumbsch. Structural relaxation made simple. Phys. Rev. Lett., 97:170201, 2006.
[148] D Packwood, J R Kermode, L Mones, N Bernstein, J Woolley, N Gould, C Ortner, and G Csanyi. A universal preconditioner for simulating condensed phase materials. J. Chem. Phys., 144:164109, 2016.
[149] Estefania Garijo del Rio, Jens J Mortensen, and Karsten W Jacobsen. Local bayesian optimizer for atomic structures. Phys. Rev. B, 100:104103, 2019.
[150] G Henkelman and H Jonsson. A dimer method for finding saddle points on high dimensional potential surfaces using only first derivatives. J. Chem. Phys., 111:7010, 1999.
[151] H Jonsson, G Mills, and K W Jacobsen. in ‘classical and quantum dynamics in condensed phase systems’, edited by b. j. berne, g. cicotti, and d. f. coker. World Scientific, 1998.
[152] G Henkelman and H Jonsson. Improved tangent estimate in the neb method for finding minimum energy paths and saddle points. J. Chem. Phys., 113:9978, 2000.
[153] G Henkelman, B P Uberuaga, and H Jonsson. A climbing-image neb method for finding saddle points and minimum energy paths.
[154] S Smidstrup, A Pedersen, K Stokbro, and H Jonsson. Improved initial guess for minimum energy path calculations. J. Chem. Phys., 140:214106, 2014.
[155] E L Kolsbjerg, M N Groves, and B Hammer. An automated nudged elastic band method. J. Chem. Phys., 145:094107, 2016.
[156] P Lindgren, G Kastlunger, and A A Peterson. Scaled and dynamic optimizations of nudged elastic bands. J. Chem. Theory Comput., 15:5787–5793, 2019.
[157] S Makri, C Ortner, and J R Kermode. A preconditioning scheme for minimum energy path finding methods. J. Chem. Phys., 150:094109, 2019.
[158] A A Peterson. Acceleration of saddle-point searches with machine learning. J. Chem. Phys., 145:074106, 2016.
[159] O Koistinen, F B Dagbjartsdottir, V Asgeirsson, A Vehtari, and H Jonsson. Nudged elastic band calculations accelerated with gaussian process regression. J. Chem. Phys., 147:152720, 2017.
[160] J A G Torres, P C Jennings, M H Hansen, J R Boes, and T Bligaard. Low–scaling algorithm for nudged elastic band calculations using a surrogate machine learning model. Phys. Rev. Lett., 122:156001, 2019.
[161] Manuel Šarić, Jan Rossmeisl, and Poul Georg Moses. Modeling the adsorption of sulfur containing molecules and their hydrodesulfurization intermediates on the Co-promoted MoS2 catalyst by DFT. J. Catal., 358:131–140, February 2018.
[162] Thomas M. Østergaard, Livia Giordano, Ivano E. Castelli, Filippo Maglia, Byron K. Antonopoulos, Yang Shao-Horn, and Jan Rossmeisl. Oxidation of ethylene carbonate on li metal oxide surfaces. The Journal of Physical Chemistry C, 122(19):10442–10449, April 2018.
[163] Logi Arnarson, Per S. Schmidt, Mohnish Pandey, Alexander Bagger, Kristian S. Thygesen, Ifan E. L. Stephens, and Jan Rossmeisl. Fundamental limitation of electrocatalytic methane conversion to methanol. Phys. Chem. Chem. Phys., 20(16):11152–11159, 2018.
[164] Hao Wan, Anders W. Jensen, María Escudero-Escribano, and Jan Rossmeisl. Insights in the oxygen reduction reaction: From metallic electrocatalysts to diporphyrins. ACS Catal., 10(11):5979–5989, April 2020.
[165] Gianluca Levi, Elisa Biasin, Asmus Ougaard Dohn, and Hannes Jónsson. On the interplay of solvent and conformational effects in simulated excited-state dynamics of a copper phenanthroline photosensitizer. Phys. Chem. Chem. Phys., 22:748–757, 2020.
[166] D A Wales and J P K Doye. Global optimization of clusters, crystals, and biomolecules. Science, 285:1368, 1999.
[167] S Goedecker. Minima hopping: An efficient search method for the global minimum of the potential energy surface of complex molecular systems. J. Chem. Phys., 120:9911, 2004.
[168] L B Vilhelmsen and B Hammer. A genetic algorithm for first principles global structure optimization of supported nano structures. The Journal of Chemical Physics, 141:044711, 2014.
[169] Mads-Peter V. Christiansen, Nikolaj Rønne, and Bjørk Hammer. Atomistic global optimization x: A python package for optimization of atomistic structures. The Journal of Chemical Physics, 157(5):054701, August 2022.
[170] Paul C. Jennings, Steen Lysgaard, Jens Strabo Hummelshøj, Tejs Vegge, and Thomas Bligaard. Genetic algorithms for computational materials discovery accelerated by machine learning. npj Comput. Mater., 5(1), April 2019.
[171] Malthe K. Bisbo and Bjørk Hammer. Efficient Global Structure Optimization with a Machine-Learned Surrogate Model. Phys. Rev. Lett., 124(8):086102, February 2020.
[172] Sami Kaappa, Casper Larsen, and Karsten Wedel Jacobsen. Atomic structure optimization with machine-learning enabled interpolation between chemical elements. Phys. Rev. Lett., 127(16), October 2021.
[173] Sami Kaappa, Estefanía Garijo del Río, and Karsten Wedel Jacobsen. Global optimization of atomic structures with gradient-enhanced gaussian process regression. Phys. Rev. B, 103(17), May 2021.
[174] Casper Larsen, Sami Kaappa, Andreas Lynge Vishart, Thomas Bligaard, and Karsten Wedel Jacobsen. Machine-learning enabled optimization of atomic structures using atoms with fractional existence, 2022.
[175] Ralf Wanzenböck, Marco Arrigoni, Sebastian Bichelmaier, Florian Buchner, Jesús Carrete, and Georg K. H. Madsen. Neural-network-backed evolutionary search for SrTiO $\_3$ (110) surface reconstructions. Digital Discovery, 1(5):703–710, 2022.
[176] Andreas Møgelhøj, André K. Kelkkanen, K. Thor Wikfeldt, Jakob Schiøtz, Jens Jørgen Mortensen, Lars G. M. Pettersson, Bengt I. Lundqvist, Karsten W. Jacobsen, Anders Nilsson, and Jens K. Nørskov. Ab initio van der Waals interactions in simulations of water alter structure from mainly tetrahedral to high-density-like. The Journal of Physical Chemistry B, 115(48):14149–14160, August 2011.
[177] Martin Hangaard Hansen and Jan Rossmeisl. pH in grand canonical statistics of an electrochemical interface. The Journal of Physical Chemistry C, 120(51):29135–29143, December 2016.
[178] A. O. Dohn, E. Ö. Jónsson, G. Levi, J. J. Mortensen, O. Lopez-Acevedo, K. S. Thygesen, K. W. Jacobsen, J. Ulstrup, N. E. Henriksen, K. B. Møller, and H. Jónsson. Grid-based projector augmented wave (GPAW) implementation of quantum mechanics/molecular mechanics (QM/MM) electrostatic embedding and application to a solvated diplatinum complex. J. Chem. Theory Comput., 13(12):6010–6022, 2017.
[179] Asmus Ougaard Dohn, Elvar Örn Jónsson, Kasper Skov Kjær, Tim Brandt van Driel, Martin Meedom Nielsen, Karsten Wedel Jacobsen, Niels Engholm Henriksen, and Klaus Braagaard Møller. Direct dynamics studies of a binuclear metal complex in solution: The interplay between vibrational relaxation, coherence, and solvent effects. J. Phys. Chem. Lett., 5(14):2414–2418, June 2014.
[180] Tim B. van Driel, Kasper S. Kjær, Robert W. Hartsock, Asmus O. Dohn, Tobias Harlang, Matthieu Chollet, Morten Christensen, Wojciech Gawelda, Niels E. Henriksen, Jong Goo Kim, Kristoffer Haldrup, Kyung Hwan Kim, Hyotcherl Ihee, Jeongho Kim, Henrik Lemke, Zheng Sun, Villy Sundström, Wenkai Zhang, Diling Zhu, Klaus B. Møller, Martin M. Nielsen, and Kelly J. Gaffney. Atomistic characterization of the active-site solvation dynamics of a model photocatalyst. Nat. Commun., 7(1), 2016.
[181] Gianluca Levi, Mátyás Pápai, Niels E. Henriksen, Asmus O. Dohn, and Klaus B. Møller. Solution structure and ultrafast vibrational relaxation of the ptpop complex revealed by $\delta$ scf-qm/mm direct dynamics simulations. J. Phys. Chem. C, 122:7100–7119, 4 2018.
[182] Kristoffer Haldrup, Gianluca Levi, Elisa Biasin, Peter Vester, Mads Goldschmidt Laursen, Frederik Beyer, Kasper Skov Kjær, Tim Brandt Van Driel, Tobias Harlang, Asmus O. Dohn, Robert J. Hartsock, Silke Nelson, James M. Glownia, Henrik T. Lemke, Morten Christensen, Kelly J. Gaffney, Niels E. Henriksen, Klaus B. Møller, and Martin M. Nielsen. Ultrafast X-Ray Scattering Measurements of Coherent Structural Dynamics on the Ground-State Potential Energy Surface of a Diplatinum Molecule. Phys. Rev. Lett., 122(6):63001, 2019.
[183] Asmus O. Dohn, Kasper S. Kjær, Tobias B. Harlang, Sophie E. Canton, Martin M. Nielsen, and Klaus B. Møller. Electron transfer and solvent-mediated electronic localization in molecular photocatalysis. Inorg. Chem., 55(20):10637–10644, October 2016.
[184] Elvar Örn Jónsson, Asmus Ougaard Dohn, and Hannes Jónsson. Polarizable embedding with a transferable H $\_2$ O potential function I: Formulation and tests on dimer. J. Chem. Theory Comput., 15(12):6562–6577, 2019.
[185] Asmus Ougaard Dohn, Elvar Örn Jónsson, and Hannes Jónsson. Polarizable embedding with a transferable H $\_2$ O potential function II: Application to H $\_2$ O clusters and liquid water. J. Chem. Theory Comput., 15(12):6578–6587, 2019.
[186] Thomas Olsen. Designing in-plane heterostructures of quantum spin Hall insulators from first principles: 1T’-MoS $\_2$ with adsorbates. Phys. Rev. B, 94:235106, 2016.
[187] A.I. Liechtenstein, M.I. Katsnelson, V.P Antropov, and V.A. Gubanov. Local spin density functional approach to the theory of exchange interactions in ferromagnetic metals and alloys. J. Magn. Magn. Mater., 67:65–74, 1987.
[188] N. D. Mermin and H. Wagner. Absence of Ferromagnetism or Antiferromagnetism in One- or Two-Dimensional Isotropic Heisenberg Models. Phys. Rev. Lett., 17:1133–1136, 1966.
[189] J. L. Lado and J. Fernández-Rossier. On the origin of magnetic anisotropy in two dimensional CrI 3. 2D Mater., 4:035002, 2017.
[190] Daniele Torelli, Kristian Sommer Thygesen, and Thomas Olsen. High throughput computational screening for 2D ferromagnetic materials: the critical role of anisotropy and local correlations. 2D Mater., 6:045018, 2019.
[191] Daniele Torelli, Hadeel Moustafa, Karsten W. Jacobsen, and Thomas Olsen. High-throughput computational screening for two-dimensional magnetic materials based on experimental databases of three-dimensional compounds. npj Comput. Mater., 6:158, 2020.
[192] U von Barth and L. Hedin. A local exchange-correlation potential for the spin polarized case. i. J. Phys. C Solid State Phys., 5:1629–1642, 1972.
[193] Giovanni Scalmani and Michael J. Frisch. A New Approach to Noncollinear Spin Density Functional Theory beyond the Local Density Approximation. J. Chem. Theory Comput., 8:2193–2196, 2012.
[194] Tae Yun Kim and Cheol-Hwan Park. Magnetic Anisotropy and Magnetic Ordering of Transition-Metal Phosphorus Trisulfides. Nano Lett., 21:10114–10121, 2021.
[195] D. Ceresoli, U. Gerstmann, A. P. Seitsonen, and F. Mauri. First-principles theory of orbital magnetization. Phys. Rev. B, 81(6):060409, 2010.
[196] A. J. P. Meyer and G. Asch. Experimental g’ and g values of Fe, Co, Ni, and their alloys. J. Appl. Phys., 32(3):S330–S333, 1961.
[197] Simon Foner. High-field antiferromagnetic resonance in Cr $\_{2}$ O $\_{3}$ . Phys. Rev., 130:183–197, 1963.
[198] T. A. Kaplan. Classical Spin-Configuration Stability in the Presence of Competing Exchange Forces. Phys. Rev., 116(4):888–889, nov 1959.
[199] L. M. Sandratskii. Energy band structure calculations for crystals with spiral magnetic structure. Phys. Status Solidi B., 136(1):167–180, 1986.
[200] LM Sandratskii. Insight into the dzyaloshinskii-moriya interaction through first-principles study of chiral magnetic structures. Phys. Rev. B, 96(2):024450, 2017.
[201] Ryogo Kubo. Statistical-Mechanical Theory of Irreversible Processes. I. General Theory and Simple Applications to Magnetic and Conduction Problems. J. Phys. Soc. Jpn., 12(6):570–586, 6 1957.
[202] E. K U Gross and Walter Kohn. Local density-functional theory of frequency-dependent linear response. Phys. Rev. Lett., 55:2850–2852, 1985.
[203] Erich Runge and E. K. U. Gross. Density-Functional Theory for Time-Dependent Systems. Phys. Rev. Lett., 52:997–1000, 1984.
[204] S. Sharma, J. K. Dewhurst, A. Sanna, and E. K. U. Gross. Bootstrap approximation for the exchange-correlation kernel of time-dependent density-functional theory. Phys. Rev. Lett., 107:186401, Oct 2011.
[205] AH MacDonald, SH Vosko, and PT Coleridge. Extensions of the tetrahedron method for evaluating spectral properties of solids. Journal of Physics C: Solid State Physics, 12(15):2991, 1979.
[206] Carlo A. Rozzi, Daniele Varsano, Andrea Marini, Eberhard K. U. Gross, and Angel Rubio. Exact coulomb cutoff technique for supercell calculations. Phys. Rev. B, 73:205119, May 2006.
[207] Falco Hüser, Thomas Olsen, and Kristian S Thygesen. How dielectric screening in two-dimensional crystals affects the convergence of excited-state calculations: Monolayer MoS $\_2$ . Phys. Rev. B, 88(24):245309, 2013.
[208] S. Latini, T. Olsen, and K. S. Thygesen. Excitons in van der waals heterostructures: The important role of dielectric screening. Phys. Rev. B, 92:245123, Dec 2015.
[209] Christopher E. Patrick and Kristian S. Thygesen. Hubbard-U corrected Hamiltonians for non-self-consistent random-phase approximation total-energy calculations: A study of ZnS, TiO $\_{2}$ , and NiO. Phys. Rev. B, 93:035133, Jan 2016.
[210] Per S. Schmidt and Kristian S. Thygesen. Benchmark database of transition metal surface and adsorption energies from many-body perturbation theory. The Journal of Physical Chemistry C, 122(8):4381–4390, 2018.
[211] Thomas Olsen, Jun Yan, Jens J. Mortensen, and Kristian S. Thygesen. Dispersive and covalent interactions between graphene and metal surfaces from the random phase approximation. Phys. Rev. Lett., 107:156401, Oct 2011.
[212] Filipp Furche and Troy Van Voorhis. Fluctuation-dissipation theorem density-functional theory. The Journal of Chemical Physics, 122(16), 04 2005. 164106.
[213] Thomas Olsen and Kristian S. Thygesen. Beyond the random phase approximation: Improved description of short-range correlation by a renormalized adiabatic local density approximation. Phys. Rev. B, 88:115131, Sep 2013.
[214] Thomas Olsen and Kristian S. Thygesen. Accurate ground-state energies of solids and molecules from time-dependent density-functional theory. Phys. Rev. Lett., 112:203001, May 2014.
[215] Christopher E. Patrick and Kristian S. Thygesen. Adiabatic-connection fluctuation-dissipation DFT for the structural properties of solids—The renormalized ALDA and electron gas kernels. The Journal of Chemical Physics, 143(10), 04 2015. 102802.
[216] Thomas Olsen, Christopher E. Patrick, Jefferson E. Bates, Adrienn Ruzsinszky, and Kristian S. Thygesen. Beyond the RPA and GW methods with adiabatic xc-kernels for accurate ground state and quasiparticle energies. npj Comput. Mater., 5(1):106, Nov 2019.
[217] Paweł Buczek, Arthur Ernst, and Leonid M. Sandratskii. Different dimensionality trends in the Landau damping of magnons in iron, cobalt, and nickel: Time-dependent density functional study. Phys. Rev. B, 84:174418, 2011.
[218] Léon Van Hove. Time-Dependent Correlations between Spins and Neutron Scattering in Ferromagnetic Crystals. Phys. Rev., 95(6):1374–1384, 9 1954.
[219] Thorbjørn Skovhus, Thomas Olsen, and Henrik M Rønnow. Influence of static correlation on the magnon dynamics of an itinerant ferromagnet with competing exchange interactions: First-principles study of mnbi. Phys. Rev. Mater., 6:054402, 5 2022.
[220] Thorbjørn Skovhus and Thomas Olsen. Magnons in antiferromagnetic bcc cr and ${\mathrm{cr}}\_{2}{\mathrm{o}}\_{3}$ from time-dependent density functional theory. Phys. Rev. B, 106:085131, Aug 2022.
[221] HN Rojas, Rex William Godby, and RJ Needs. Space-time method for ab initio calculations of self-energies and dielectric response functions of solids. Phys. Rev. Lett., 74(10):1827, 1995.
[222] Christoph Friedrich, Stefan Blügel, and Arno Schindlmayr. Efficient implementation of the g w approximation within the all-electron flapw method. Phys. Rev. B, 81(12):125102, 2010.
[223] Ivan Duchemin and Xavier Blase. Robust analytic-continuation approach to many-body gw calculations. J. Chem. Theory Comput., 16(3):1742–1756, 2020.
[224] Dario A Leon, Claudia Cardoso, Tommaso Chiarotti, Daniele Varsano, Elisa Molinari, and Andrea Ferretti. Frequency dependence in g w made simple using a multipole approximation. Phys. Rev. B, 104(11):115157, 2021.
[225] Filip A Rasmussen, Per S Schmidt, Kirsten T Winther, and Kristian S Thygesen. Efficient many-body calculations for two-dimensional materials using exact limits for the screened potential: Band gaps of MoS $\_2$ , H-BN, and phosphorene. Phys. Rev. B, 94(15):155406, 2016.
[226] Andrea Marini, Conor Hogan, Myrta Grüning, and Daniele Varsano. Yambo: an ab initio tool for excited state calculations. Comput. Phys. Commun., 180(8):1392–1403, 2009.
[227] Thomas Olsen, Simone Latini, Filip Rasmussen, and Kristian S. Thygesen. Simple Screened Hydrogen Model of Excitons in Two-Dimensional Materials. Phys. Rev. Lett., 116:056401, 2016.
[228] Thomas Olsen. Unified Treatment of Magnons and Excitons in Monolayer CrI $\_3$ from Many-Body Perturbation Theory. Phys. Rev. Lett., 127:166402, 2021.
[229] Zhen Li, Patrizio Graziosi, and Neophytos Neophytou. Deformation potential extraction and computationally efficient mobility calculations in silicon from first principles. Phys. Rev. B, 104(19):195201, 2021.
[230] Amy Y Liu and Andrew A Quong. Linear-response calculation of electron-phonon coupling parameters. Phys. Rev. B, 53(12):R7575, 1996.
[231] Feliciano Giustino. Electron-phonon interactions from first principles. Rev. Mod. Phys., 89:015003, Feb 2017.
[232] Michael Walter and Michael Moseler. Ab Initio Wavelength-Dependent Raman Spectra: Placzek Approximation and Beyond. J. Chem. Theory Comput., 16(1):576–586, January 2020.
[233] Ado Jorio, Mildred S Dresselhaus, Riichiro Saito, and Gene Dresselhaus. Raman spectroscopy in graphene related systems. John Wiley & Sons, 2011.
[234] Atsushi Togo, Laurent Chaput, Terumasa Tadano, and Isao Tanaka. Implementation strategies in phonopy and phono3py. J. Phys. Condens. Matter, 35(35):353001, 2023.
[235] Robert W. Boyd. Nonlinear Optics. Elsevier Science Publishing Co Inc., Amsterdam, 3rd edition, 2008.
[236] Alireza Taghizadeh, F. Hipolito, and T. G. Pedersen. Linear and nonlinear optical response of crystals using length and velocity gauges: Effect of basis truncation. Phys. Rev. B, 96:195413, 2017.
[237] Claudio Aversa and J. E. Sipe. Nonlinear optical susceptibilities of semiconductors: Results with a length-gauge analysis. Phys. Rev. B, 52:14636–14645, 1995.
[238] Arto Sakko, Tuomas P Rossi, and Risto M Nieminen. Dynamical coupling of plasmons and molecular excitations by hybrid quantum/classical calculations: time-domain approach. Journal of Physics: Condensed Matter, 26(31):315013, jul 2014.
[239] Tuomas P. Rossi, Kirsten T. Winther, Karsten W. Jacobsen, Risto M. Nieminen, Martti J. Puska, and Kristian S. Thygesen. Effect of edge plasmons on the optical properties of MoS $\_{2}$ monolayer flakes. Phys. Rev. B, 96:155407, Oct 2017.
[240] Priyank V. Kumar, Tuomas P. Rossi, Mikael Kuisma, Paul Erhart, and David J. Norris. Direct hot-carrier transfer in plasmonic catalysis. Faraday Discuss., 214:189–197, 2019.
[241] Priyank V. Kumar, Tuomas P. Rossi, Daniel Marti-Dafcik, Daniel Reichmuth, Mikael Kuisma, Paul Erhart, Martti J. Puska, and David J. Norris. Plasmon-Induced Direct Hot-Carrier Transfer at Metal–Acceptor Interfaces. Acs Nano, 13(3):3188–3195, 2019.
[242] Tuomas P. Rossi, Timur Shegai, Paul Erhart, and Tomasz J. Antosiewicz. Strong plasmon-molecule coupling at the nanoscale revealed by first-principles modeling. Nat. Commun., 10(1):3336, 2019.
[243] Tuomas P. Rossi, Paul Erhart, and Mikael Kuisma. Hot-Carrier Generation in Plasmonic Nanoparticles: The Importance of Atomic Structure. Acs Nano, 14(8):9963–9971, 2020.
[244] Jakub Fojt, Tuomas P. Rossi, Mikael Kuisma, and Paul Erhart. Hot-Carrier Transfer across a Nanoparticle–Molecule Junction: The Importance of Orbital Hybridization and Level Alignment. Nano Lett., 22(21):8786–8792, 2022.
[245] Daniel Sorvisto, Patrick Rinke, and Tuomas P. Rossi. Single-atom dopants in plasmonic nanocatalysts. The Journal of Physical Chemistry C, 127(18):8585–8590, 2023.
[246] K. Yabana and G. F. Bertsch. Time-dependent local-density approximation in real time. Phys. Rev. B, 54(7):4484–4487, 1996.
[247] Sami Malola, Lauri Lehtovaara, Jussi Enkovaara, and Hannu Häkkinen. Birth of the localized surface plasmon resonance in monolayer-protected gold nanoclusters. Acs Nano, 7(11):10263–10270, 2013.
[248] Tuomas Rossi, Paul Erhart, and Mikael Kuisma. Code for "hot-carrier generation in plasmonic nanoparticles: The importance of atomic structure". Zenodo.
[249] Jakub Fojt, Tuomas Rossi, Mikael Kuisma, and Paul Erhart. Code for "hot-carrier transfer across a nanoparticle-molecule junction: The importance of orbital hybridization and level alignment". Zenodo.
[250] Christian Schäfer and Göran Johansson. Shortcut to self-consistent light-matter interaction and realistic spectra from first principles. Phys. Rev. Lett., 128:156402, Apr 2022.
[251] Christian Schäfer. Polaritonic chemistry from first principles via embedding radiation reaction. J. Phys. Chem. Lett., 13(30):6905–6911, 2022. PMID: 35866694.
[252] Ari Ojanperä, Ville Havu, Lauri Lehtovaara, and Martti Puska. Nonadiabatic Ehrenfest molecular dynamics within the projector augmented-wave method. The Journal of Chemical Physics, 136(14):144103, April 2012. Publisher: American Institute of Physics.
[253] Ari Ojanperä, Arkady V. Krasheninnikov, and Martti Puska. Electronic stopping power from first-principles calculations with account for core electron excitations and projectile ionization. Phys. Rev. B, 89(3):035120, 2014.
[254] Olga A. Syzgantseva, Martti Puska, and Kari Laasonen. Charge Transfer at the Hybrid Interfaces in the Presence of Water: A Theoretical Study. The Journal of Physical Chemistry C, 119(51):28347–28352, December 2015.
[255] Christian Brand, Maxime Debiossac, Toma Susi, François Aguillon, Jani Kotakoski, Philippe Roncin, and Markus Arndt. Coherent diffraction of hydrogen through the 246 pm lattice of graphene. New J. Phys., 21(3):033004, March 2019.
[256] Evgeny A. Buntov and Anatoly F. Zatsepin. Carbon Bond Breaking under Ar+-Ion Irradiation in Dependence on sp Hybridization: Car–Parrinello, Ehrenfest, and Classical Dynamics Study. J. Phys. Chem. A, 124(44):9128–9132, November 2020.
[257] Yierpan Aierken, Ankit Agrawal, Meiling Sun, Marko Melander, Ethan J. Crumlin, Brett A. Helms, and David Prendergast. Revealing Charge-Transfer Dynamics at Electrified Sulfur Cathodes Using Constrained Density Functional Theory. J. Phys. Chem. Lett., 12(2):739–744, January 2021. Publisher: American Chemical Society.
[258] Emilio Artacho and David D. O’Regan. Quantum mechanics in an evolving Hilbert space. Phys. Rev. B, 95(11):115155, March 2017. Publisher: American Physical Society.
[259] Siesta: Recent developments and applications. The Journal of Chemical Physics, 152(20):204108, May 2020. Publisher: American Institute of Physics.
[260] Andreas Dreuw and Martin Head-Gordon. Failure of Time-Dependent Density Functional Theory for Long-Range Charge-Transfer Excited States: The Zincbacteriochlorin-Bacteriochlorin and Bacteriochlorophyll-Spheroidene Complexes. J. Am. Chem. Soc., 126(12):4007–4016, 2004.
[261] Issaka Seidu, Mykhaylo Krykunov, and Tom Ziegler. Applications of time-dependent and time-independent density functional theory to Rydberg transitions. J. Phys. Chem. A, 119(21):5107–5116, 2015.
[262] Benjamin G. Levine, Chaehyuk Ko, Jason Quenneville, and Todd J. Martínez. Conical intersections and double excitations in time-dependent density functional theory. Mol. Phys., 104:1039–1051, 3 2006.
[263] Sigeru Huzinaga and Catalina Arnau. Virtual Orbitals in Hartree-Fock Theory. Phys. Rev. A, 1(5):1285–1288, May 1970.
[264] Sigeru Huzinaga and Catalina Arnau. Virtual Orbitals in Hartree–Fock Theory. II. The Journal of Chemical Physics, 54(5):1948–1951, March 1971.
[265] Diptarka Hait and Martin Head-Gordon. Orbital optimized density functional theory for electronic excited states. J. Phys. Chem. Lett., 12:4517–4529, 5 2021.
[266] Giuseppe M.J. Barca, Andrew T.B. Gilbert, and Peter M.W. Gill. Simple models for difficult electronic excitations. J. Chem. Theory Comput., 14:1501–1509, 3 2018.
[267] Andrew T.B. Gilbert, Nicholas A. Besley, and Peter M.W. Gill. Self-consistent field calculations of excited states using the maximum overlap method (mom). J. Phys. Chem. A, 112:13164–13171, 12 2008.
[268] Gianluca Levi, Aleksei V. Ivanov, and Hannes Jónsson. Variational calculations of excited states via direct optimization of orbitals in DFT. Faraday Discuss., 224:448–466, 12 2020.
[269] Pierre François Loos, Massimiliano Comin, Xavier Blase, and Denis Jacquemin. Reference energies for intramolecular charge-transfer excitations. J. Chem. Theory Comput., 17:3666–3686, 6 2021.
[270] Tetsuo Katayama, Tae Kyu Choi, Dmitry Khakhulin, Asmus O Dohn, Christopher J. Milne, György Vankó, Zoltán Németh, Frederico A. Lima, Jakub Szlachetko, Tokushi Sato, Shunsuke Nozawa, Shin Ichi Adachi, Makina Yabashi, Thomas J. Penfold, Wojciech Gawelda, and Gianluca Levi. Atomic-scale observation of solvent reorganization influencing photoinduced structural dynamics in a copper complex photosensitizer. Chem. Sci., 14:2572–2584, 2023.
[271] He Ma, Marco Govoni, and Giulia Galli. Quantum simulations of materials on near term quantum computers. npj Comput. Mater., 6(1):85, December 2020.
[272] Mostafa Abedi, Gianluca Levi, Diana B Zederkof, Niels Engholm Henriksen, Mátyás Pápai, and Klaus B. Møller. Excited-State Solvation Structure of Transition Metal Complexes from Molecular Dynamics Simulations and Assessment of Partial Atomic Charge Methods. Phys. Chem. Chem. Phys., 21:4082–4095, 2019.
[273] Aleksei V. Ivanov, Yorick L. A. Schmerwitz, Gianluca Levi, and Hannes Jónsson. Electronic excitations of the charged nitrogen-vacancy center in diamond obtained using time-independent variational density functional calculations. SciPost Phys., 15:009, 2023.
[274] Theory of polarization of crystalline solids. Phys. Rev. B, 47:1651–1654, 1993.
[275] Raffaele Resta. Macroscopic polarization in crystalline dielectrics: the geometric phase approach. Rev. Mod. Phys., 66:899–915, 1994.
[276] Raffaele Resta and David Vanderbilt. Theory of Polarization: A Modern Approach. In Phys. Ferroelectr., volume 105, pages 31–68. 2007.
[277] Yasuhiro Yoneda, Koji Ohara, and Hajime Nagata. Local structure and phase transitions of KNbO 3. Jpn. J. Appl. Phys., 57:11UB07, 2018.
[278] Marco Gibertini and Nicola Marzari. Emergence of One-Dimensional Wires of Free Carriers in Transition-Metal-Dichalcogenide Nanostructures. Nano Lett., 15:6229–6238, 2015.
[279] Joachim Sødequist, Urko Petralanda, and Thomas Olsen. Abundance of second order topology in C 3 symmetric two-dimensional insulators. 2D Mater., 10:015009, 2023.
[280] Stefano Baroni, Stefano De Gironcoli, Andrea Dal Corso, and Paolo Giannozzi. Phonons and related crystal properties from density-functional perturbation theory. Rev. Mod. Phys., 73:515–562, 2001.
[281] Ruixiang Fei, Wei Kang, and Li Yang. Ferroelectricity and Phase Transitions in Monolayer Group-IV Monochalcogenides. Phys. Rev. Lett., 117:097601, 2016.
[282] Tonatiuh Rangel, Benjamin M Fregoso, Bernardo S Mendoza, Takahiro Morimoto, Joel E Moore, and Jeffrey B Neaton. Large Bulk Photovoltaic Effect and Spontaneous Polarization of Single-Layer Monochalcogenides. Phys. Rev. Lett., 119:067402, 2017.
[283] Hua Wang and Xiaofeng Qian. Two-dimensional multiferroics in monolayer group IV monochalcogenides. 2D Mater., 4:015042, 2017.
[284] Urko Petralanda and Thomas Olsen. Polarization switching induced by domain wall sliding in two-dimensional ferroelectric monochalcogenides. 2D Mater., 10:015001, 2023.
[285] Maryam Taherinejad, Kevin F. Garrity, and David Vanderbilt. Wannier center sheets in topological insulators. Phys. Rev. B, 89:115102, 2014.
[286] Nicola Marzari and David Vanderbilt. Maximally localized generalized wannier functions for composite energy bands. Phys. Rev. B, 56(20):12847, 1997.
[287] Xiaofeng Qian, Junwei Liu, Liang Fu, and Ju Li. Quantum spin Hall effect in two-dimensional transition metal dichalcogenides. Science, 346:1344–1347, 2014.
[288] Raffaele Resta and Sandro Sorella. Electron localization in the insulating state. Phys. Rev. Lett., 82(2):370, 1999.
[289] Gerd Berghold, Christopher J Mundy, Aldo H Romero, Jürg Hutter, and Michele Parrinello. General and efficient algorithms for obtaining maximally localized wannier functions. Phys. Rev. B, 61(15):10040, 2000.
[290] Kristian S Thygesen, Lars Bruno Hansen, and Karsten Wedel Jacobsen. Partly occupied wannier functions. Phys. Rev. Lett., 94(2):026405, 2005.
[291] Kristian Sommer Thygesen, Lars Bruno Hansen, and Karsten Wedel Jacobsen. Partly occupied wannier functions: Construction and applications. Phys. Rev. B, 72(12):125119, 2005.
[292] Pietro F Fontana, Ask H Larsen, Thomas Olsen, and Kristian S Thygesen. Spread-balanced wannier functions: Robust and automatable orbital localization. Phys. Rev. B, 104(12):125140, 2021.
[293] Cyrus E Dreyer, Audrius Alkauskas, John L Lyons, Anderson Janotti, and Chris G Van de Walle. First-principles calculations of point defects for quantum technologies. Annu. Rev. Mater. Res., 48:1–26, 2018.
[294] Ji Sang Park, Sunghyun Kim, Zijuan Xie, and Aron Walsh. Point defect engineering in thin-film solar cells. Nature Reviews Materials, 3(7):194–210, 2018.
[295] L Weston, D Wickramaratne, M Mackoit, A Alkauskas, and CG Van de Walle. Native point defects and impurities in hexagonal boron nitride. Phys. Rev. B, 97(21):214104, 2018.
[296] TQP Vuong, Guillaume Cassabois, Pierre Valvin, Abdelkarim Ouerghi, Yannick Chassagneux, Christophe Voisin, and Bernard Gil. Phonon-photon mapping in a color center in hexagonal boron nitride. Phys. Rev. Lett., 117(9):097402, 2016.
[297] Chris G Van de Walle and Jörg Neugebauer. First-principles calculations for defects and impurities: Applications to iii-nitrides. J. Appl. Phys., 95(8):3851–3879, 2004.
[298] Stephan Lany and Alex Zunger. Assessment of correction methods for the band-gap problem and for finite-size effects in supercell defect calculations: Case studies for zno and gaas. Phys. Rev. B, 78(23):235104, 2008.
[299] Christoph Freysoldt, Blazej Grabowski, Tilmann Hickel, Jörg Neugebauer, Georg Kresse, Anderson Janotti, and Chris G Van de Walle. First-principles calculations for point defects in solids. Rev. Mod. Phys., 86(1):253, 2014.
[300] Alex Zunger and Oleksandr I Malyi. Understanding doping of quantum materials. Chem. Rev., 121(5):3031–3060, 2021.
[301] Oleksandr O Kurakevych and Vladimir L Solozhenko. Rhombohedral boron subnitride, b13n2, by x-ray powder diffraction. Acta Crystallogr. C., 63(9):i80–i82, 2007.
[302] Guillaume Cassabois, Pierre Valvin, and Bernard Gil. Hexagonal boron nitride is an indirect bandgap semiconductor. Nat. Photonics, 10(4):262–266, 2016.
[303] Christoph Freysoldt, Jörg Neugebauer, and Chris G Van de Walle. Fully ab initio finite-size corrections for charged-defect supercell calculations. Phys. Rev. Lett., 102(1):016402, 2009.
[304] Sami Kaappa, Sami Malola, and Hannu Häkkinen. Point group symmetry analysis of the electronic structure of bare and protected metal nanocrystals. J. Phys. Chem. A, 122(43):8576–8584, 2018.
[305] Fabian Bertoldo, Sajid Ali, Simone Manti, and Kristian S. Thygesen. Quantum point defects in 2D materials - the QPOD database. npj Comput. Mater., 8(1):56, Apr 2022.
[306] J.F. Cornwell. Group Theory in Physics, volume 1 of Techniques of Physics. Academic Press, San Diego, 1997.
[307] Voicu Popescu and Alex Zunger. Extracting E versus $\vec{k}$ effective band structure from supercell calculations on alloys and impurities. Phys. Rev. B, 85(8):085201, 2012.
[308] Quantum Electrostatic Heterostructure Model. https://qeh.readthedocs.io/en/latest/, 2019. [Online; accessed 15-May-2023].
[309] Kirsten Andersen, Simone Latini, and Kristian S Thygesen. Dielectric genome of van der Waals heterostructures. Nano Lett., 15(7):4616–4621, 2015.
[310] Calculating new building blocks. https://qeh.readthedocs.io/en/latest/tutorials.html, 2019. [Online; accessed 15-May-2023].
[311] Alexander Held and Michael Walter. Simplified continuum solvent model with a smooth cavity based on volumetric data. The Journal of Chemical Physics, 141(17):174108, 2014.
[312] A. Y. Lozovoi, A Alavi, Jorge Kohanoff, and R M Lynden-Bell. Ab initio simulation of charged slabs at constant chemical potential. The Journal of Chemical Physics, 115(4):1661–1669, jul 2001.
[313] Marko M Melander, Mikael J Kuisma, Thorbjørn Erik Køppen Christensen, and Karoliina Honkala. Grand-canonical approach to density functional theory of electrocatalytic systems: Thermodynamics of solid-liquid interfaces at constant ion and electrode potentials. The Journal of Chemical Physics, 150(4):041706, jan 2019.
[314] Ken Sakaushi, Tomoaki Kumeda, Sharon Hammes-Schiffer, Marko M Melander, and Osamu Sugino. Advances and challenges for experiment and theory for multi-electron multi-proton transfer at electrified solid–liquid interfaces. Phys. Chem. Chem. Phys., 22(35):19401–19442, 2020.
[315] Per Lindgren, Georg Kastlunger, and Andrew A. Peterson. Electrochemistry from the atomic scale, in the electronically grand-canonical ensemble. The Journal of Chemical Physics, 157(18):180902, nov 2022.
[316] M Otani and O Sugino. First-principles calculations of charged surfaces and interfaces: A plane-wave nonrepeated slab approach. Phys. Rev. B, 73(11):115407, mar 2006.
[317] Kiran Mathew, Ravishankar Sundararaman, Kendra Letchworth-Weaver, T A Arias, and Richard G Hennig. Implicit solvation model for density-functional study of nanocrystal surfaces and reaction pathways. The Journal of Chemical Physics, 140(8):84106, 2014.
[318] Kendra Letchworth-Weaver and T. A. Arias. Joint density functional theory of the electrode-electrolyte interface: Application to fixed electrode potentials, interfacial capacitances, and potentials of zero charge. Phys. Rev. B, 86(7):075140, aug 2012.
[319] Nicolas G. Hörmann, Oliviero Andreussi, and Nicola Marzari. Grand canonical simulations of electrochemical interfaces in implicit solvation models. The Journal of Chemical Physics, 150(4):041730, jan 2019.
[320] Stefan Ringe, Nicolas G Hörmann, Harald Oberhofer, and Karsten Reuter. Implicit Solvation Methods for Catalysis at Electrified Interfaces. Chem. Rev., 122(12):10777–10820, jun 2022.
[321] Georg Kastlunger, Per Lindgren, and Andrew A. Peterson. Controlled-potential simulation of elementary electrochemical reactions: Proton discharge on metal surfaces. The Journal of Physical Chemistry C, 122(24):12771–12781, jun 2018.
[322] Sergio Trasatti. The absolute electrode potential: an explanatory note (recommendations 1986). Journal of Electroanalytical Chemistry and Interfacial Electrochemistry, 209(2):417–428, sep 1986.
[323] Ravishankar Sundararaman and Kathleen Schwarz. Evaluating continuum solvation models for the electrode-electrolyte interface: Challenges and strategies for improvement. The Journal of Chemical Physics, 146(8), 02 2017.
[324] Marko M. Melander, Mikael J. Kuisma, Thorbjørn Erik Køppen Christensen, and Karoliina Honkala. Grand-canonical approach to density functional theory of electrocatalytic systems: Thermodynamics of solid-liquid interfaces at constant ion and electrode potentials. The Journal of Chemical Physics, 150(4), 11 2018.
[325] M. Weinert and J. W. Davenport. Fractional occupations and density-functional energies and forces. Phys. Rev. B, 45:13709–13712, Jun 1992.
[326] Per Lindgren, Georg Kastlunger, and Andrew A Peterson. A Challenge to the G $\approx$ 0 Interpretation of Hydrogen Evolution. ACS Catal., 10(1):121–128, jan 2020.
[327] Georg Kastlunger, Lei Wang, Nitish Govindarajan, Hendrik H Heenen, Stefan Ringe, Thomas Jaramillo, Christopher Hahn, and Karen Chan. Using pH Dependence to Understand Mechanisms in Electrochemical CO Reduction. ACS Catal., 12(8):4344–4357, apr 2022.
[328] Georg Kastlunger, Hendrik H. Heenen, and Nitish Govindarajan. Combining First-Principles Kinetics and Experimental Data to Establish Guidelines for Product Selectivity in Electrochemical CO 2 Reduction. ACS Catal., 13(7):5062–5072, apr 2023.
[329] Marko Melander, Tongwei Wu, and Karoliina Honkala. Constant inner potential DFT for modelling electrochemical systems under constant potential and bias. ChemRxiv, 2023. DOI=10.26434/chemrxiv-2021-r621x-v3.
[330] Qin Wu and Troy Van Voorhis. Direct optimization method to study constrained systems within density-functional theory. Phys. Rev. A, 72:024502, Aug 2005.
[331] Qin Wu and Troy Van Voorhis. Extracting electron transfer coupling elements from constrained density functional theory. The Journal of Chemical Physics, 125(16), 10 2006.
[332] Benjamin Kaduk, Tim Kowalczyk, and Troy Van Voorhis. Constrained density functional theory. Chem. Rev., 112(1):321–370, 2012.
[333] Marko Melander, Elvar Ö. Jónsson, Jens J. Mortensen, Tejs Vegge, and Juan Maria García Lastra. Implementation of Constrained DFT for Computing Charge Transfer Rates within the Projector Augmented Wave Method. J. Chem. Theory Comput., 12(11):5367–5378, 2016.
[334] Haesun Park, Nitin Kumar, Marko Melander, Tejs Vegge, Juan Maria Garcia Lastra, and Donald J. Siegel. Adiabatic and Nonadiabatic Charge Transport in Li–S Batteries. Chem. Mater., 30(3):915–928, 2018.
[335] Tomoaki Kumeda, Laura Laverdure, Karoliina Honkala, Marko M. Melander, and Ken Sakaushi. Cations determine the mechanism and selectivity of alkaline ORR on Pt(111), 2022.
[336] Marko M. Melander. Grand canonical rate theory for electrochemical and electrocatalytic systems i: General formulation and proton-coupled electron transfer reactions. J. Electrochem. Soc., 167(11):116518, jul 2020.
[337] Mel Levy, John P. Perdew, and Viraht Sahni. Exact differential equation for the density and ionization energy of a many-particle system. Phys. Rev. A, 30:2745–2748, Nov 1984.
[338] Jouko Lehtomäki, Ilja Makkonen, Miguel A. Caro, Ari Harju, and Olga Lopez-Acevedo. Orbital-free density functional theory implementation with the projector augmented-wave method. The Journal of Chemical Physics, 141(23), 12 2014. 234102.
[339] Jouko Lehtomäki and Olga Lopez-Acevedo. Large-Z limit in atoms and solids from first principles. The Journal of Chemical Physics, 151(24), 12 2019. 244101.
[340] Viktor Ivády, Tamás Simon, Jeronimo R Maze, IA Abrikosov, and Adam Gali. Pressure and temperature dependence of the zero-field splitting in the ground state of nv centers in diamond: A first-principles study. Phys. Rev. B, 90(23):235205, 2014.
[341] MJ Rayson and PR Briddon. First principles method for the calculation of zero-field splitting tensors in periodic systems. Phys. Rev. B, 77(3):035119, 2008.
[342] Timur Biktagirov, Wolf Gero Schmidt, and Uwe Gerstmann. Spin decontamination for magnetic dipolar coupling calculations: Application to high-spin molecules and solid-state spin qubits. Physical Review Research, 2(2):022024, 2020.
[343] Peter E Blöchl. First-principles calculations of defects in oxygen-deficient silica exposed to hydrogen. Phys. Rev. B, 62(10):6158, 2000.