Theory and Modern Applications

# Newton-Kantorovich convergence theorem of a new modified Halley’s method family in a Banach space

## Abstract

A Newton-Kantorovich convergence theorem of a new modified Halley’s method family is established in a Banach space to solve nonlinear operator equations. We also present the main results to reveal the competence of our method. Finally, two numerical examples arising in the theory of the radiative transfer, neutron transport and in the kinetic theory of gasses are provided to show the application of our theorem.

## Introduction

In the last two centuries, remarkable contributions have been made to both the theory and application of nonlinear equations. Suppose that we have to find a solution of the nonlinear equation

$F\left(x\right)=0,$
(1)

where F is defined on an open convex subset Ω of a Banach space X with values in a Banach space Y.

These equations are increasingly used to model problems in engineering applications, such as material science, electrical engineering, civil engineering, chemical engineering, mechanics and numerical optimization. There are several iterative methods  used to find a solution of nonlinear equations. One of those iterative methods is the famous Newton’s method

${x}_{n+1}={x}_{n}-{F}^{\prime }{\left({x}_{n}\right)}^{-1}F\left({x}_{n}\right)\phantom{\rule{1em}{0ex}}\left(n\ge 0\right)\phantom{\rule{0.25em}{0ex}}\left({x}_{0}\in \mathrm{\Omega }\right)$
(2)

often used to solve the nonlinear operator equation under the reasonable hypotheses. However, Newton’s method is only the second-order convergence. Kantorovich presented the famous convergence result , and afterward, many Newton-Kantorovich-type convergence theorems have been attained . Furthermore, many deformed methods  have been presented to improve the convergence order. The famous Halley’s method, which has been widely discussed , is the third-order convergence. The famous Halley’s method is defined as

${x}_{n+1}={x}_{n}-\left[I+\frac{1}{2}{L}_{F}\left({x}_{n}\right){\left(I-\frac{1}{2}{L}_{F}\left({x}_{n}\right)\right)}^{-1}\right]{F}^{\prime }{\left({x}_{n}\right)}^{-1}F\left({x}_{n}\right),\phantom{\rule{1em}{0ex}}n=0,1,\dots ,$

where

${L}_{F}\left(x\right)={F}^{\prime }{\left(x\right)}^{-1}{F}^{″}\left(x\right){F}^{\prime }{\left(x\right)}^{-1}F\left(x\right),\phantom{\rule{1em}{0ex}}x\in \mathrm{\Omega }.$

Now, we consider Halley’s method with a parameter λ in the form

${x}_{\lambda ,n+1}={x}_{\lambda ,n}-\left[I+\frac{1}{2}{L}_{F}\left({x}_{\lambda ,n}\right){\left(I-\lambda {L}_{F}\left({x}_{\lambda ,n}\right)\right)}^{-1}\right]{F}^{\prime }{\left({x}_{\lambda ,n}\right)}^{-1}F\left({x}_{\lambda ,n}\right),\phantom{\rule{1em}{0ex}}n=0,1,\dots .$

One can see that Halley’s method and super-Halley’s method are the special cases for $\lambda =\frac{1}{2}$ and $\lambda =1$. In this method, in every step, one needs to compute the second order derivatives of the function F. The computing cost will be the high. To avoid the computation of ${F}^{″}\left({x}_{n}\right)$, and to maintain the high order convergence, many researchers have replaced the second order derivative with the first order divided differences. They presented the modified Halley’s methods with the parameters p, λ. Their modified Halley’s method is as follows :

$\left\{\begin{array}{c}{y}_{n}={x}_{n}-{F}^{\prime }{\left({x}_{n}\right)}^{-1}F\left({x}_{n}\right),\hfill \\ H\left({x}_{n},{y}_{n}\right)=\frac{1}{p}{F}^{\prime }{\left({x}_{n}\right)}^{-1}\left[{F}^{\prime }\left({x}_{n}+p\left({y}_{n}-{x}_{n}\right)\right)-{F}^{\prime }\left({x}_{n}\right)\right],\phantom{\rule{1em}{0ex}}\lambda \in \left[0,1\right],p\in \left(0,1\right],\hfill \\ {x}_{n+1}={y}_{n}-\frac{1}{2}H\left({x}_{n},{y}_{n}\right)\left[I-\lambda H\left({x}_{n},{y}_{n}\right)\right]\left({y}_{n}-{x}_{n}\right).\hfill \end{array}$
(3)

For $p=\frac{1}{2}$, $\lambda =0$, the method becomes Chebyshev’s iterative method (see ). For $p=\frac{2}{3}$, $\lambda =1$, the method becomes inverse-free Jarratt iterative method (see [14, 15]). In this paper, we establish a Kantorovich-type third-order convergence theorem for this kind of method by using majorizing function to improve the result .

## 1 Main results

In this section, we establish a Newton-Kantorovich convergence theorem via majorizing function. Let $g\left(t\right)=\frac{1}{6}K{t}^{3}+\frac{1}{2}\gamma {t}^{2}-t+\eta$, where K, γ, η are positive real numbers. Denote

$\alpha =\frac{2}{\gamma +\sqrt{{\gamma }^{2}+2K}},\phantom{\rule{2em}{0ex}}\beta =\alpha -\frac{1}{6}K{\alpha }^{3}-\frac{1}{2}\gamma {\alpha }^{2}=\frac{2\left(\gamma +2\sqrt{{\gamma }^{2}+2K}\right)}{3{\left(\gamma +\sqrt{{\gamma }^{2}+2K}\right)}^{2}}.$

Theorem 1 Suppose that X and Y are the Banach spaces, and Ω is an open convex subset of X, $F:\mathrm{\Omega }\subset X\to Y$ has the second order Fréchet derivative, ${F}^{\prime }{\left({x}_{0}\right)}^{-1}$ exists for ${x}_{0}\in \mathrm{\Omega }$, and the following conditions hold:

$\begin{array}{r}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}F\left({x}_{0}\right)\parallel \le \eta ,\phantom{\rule{2em}{0ex}}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{0}\right)\parallel \le \gamma ,\\ \parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}\left({F}^{″}\left(x\right)-{F}^{″}\left(y\right)\right)\parallel \le N\parallel x-y\parallel ,\phantom{\rule{1em}{0ex}}x,y\in \mathrm{\Omega },\\ \frac{2+3p}{2-3p}N\le K,\phantom{\rule{2em}{0ex}}\eta <\beta ,\\ \overline{S\left({x}_{0},{r}_{1}\right)}\subset \mathrm{\Omega },\end{array}$
(4)

where ${r}_{1}\le {r}_{2}$ are two positive real roots of the function $g\left(t\right)$. Then, for $0, the sequence ${\left\{{x}_{n}\right\}}_{n\ge 0}$ generated by (3) is well defined, ${x}_{n}\in \overline{S\left({x}_{0},{r}_{1}\right)}$ and converges to the unique solution ${x}^{\ast }$ of equation (1) in $S\left({x}_{0},\alpha \right)$.

Theorem 2 Suppose that X and Y are the Banach spaces, and Ω is an open convex subset of X, $F:\mathrm{\Omega }\subset X\to Y$ has the third-order Fréchet derivative, ${F}^{\prime }{\left({x}_{0}\right)}^{-1}$ exists for ${x}_{0}\in \mathrm{\Omega }$, and the following conditions hold:

$\begin{array}{r}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}F\left({x}_{0}\right)\parallel \le \eta ,\phantom{\rule{2em}{0ex}}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{0}\right)\parallel \le \gamma ,\phantom{\rule{2em}{0ex}}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{‴}\left(x\right)\parallel \le N,\\ \parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}\left({F}^{‴}\left(x\right)-{F}^{‴}\left(y\right)\right)\parallel \le L\parallel x-y\parallel ,\phantom{\rule{1em}{0ex}}x,y\in \mathrm{\Omega },\\ \frac{\left(1+2{p}^{2}\right)L}{6\gamma \left(1-p\right)}+N\le K,\phantom{\rule{2em}{0ex}}\eta <\beta ,\\ \overline{S\left({x}_{0},{r}_{1}\right)}\subset \mathrm{\Omega }.\end{array}$
(5)

Then for $0, the sequence ${\left\{{x}_{n}\right\}}_{n\ge 0}$ generated by (3) is well defined, ${x}_{n}\in \overline{S\left({x}_{0},{r}_{1}\right)}$ and converges to the unique solution ${x}^{\ast }$ of equation (1) in $S\left({x}_{0},\alpha \right)$.

To prove Theorems 1 and 2, we first give some lemmas.

Lemma 1 If $\eta \le \beta$, the polynomial $g\left(t\right)$ has two positive real roots ${r}_{1}$, ${r}_{2}$ (let $0<{r}_{1}<{r}_{2}<+\mathrm{\infty }$), and a negative real root $-{r}_{0}$ (${r}_{0}>0$).

Proof From definition of the function $g\left(t\right)$, there follows that $g\left(0\right)=\eta >0$, ${lim}_{t\to -\mathrm{\infty }}g\left(t\right)=-\mathrm{\infty }$, hence $g\left(t\right)$ has a negative root. Denote it $-{r}_{0}$. We get that ${g}^{\prime }\left(t\right)=\frac{1}{2}K{t}^{2}+\gamma t-1$ has the unique positive root $\alpha =\frac{2}{\gamma +\sqrt{{\gamma }^{2}+2K}}$, and for $t\ge 0$, ${g}^{″}\left(t\right)=Kt+\gamma >0$. So, the necessary and sufficient condition that $g\left(t\right)$ has two positive roots for $t\ge 0$ is that the minimum of $g\left(t\right)$ satisfies $g\left(\alpha \right)\le 0$, that is also $\eta \le \beta$. This completes the proof of Lemma 1. □

Lemma 2 (see )

Suppose that the sequences ${\left\{{t}_{n}\right\}}_{n\ge 0}$ and ${\left\{{s}_{n}\right\}}_{n\ge 0}$ are generated by the following iteration ${t}_{0}=0$,

$\left\{\begin{array}{c}{s}_{n}={t}_{n}-{g}^{\prime }{\left({t}_{n}\right)}^{-1}g\left({t}_{n}\right),\hfill \\ {H}_{g}\left({t}_{n},{s}_{n}\right)=\frac{1}{p}{g}^{\prime }{\left({t}_{n}\right)}^{-1}\left[{g}^{\prime }\left({t}_{n}+p\left({s}_{n}-{t}_{n}\right)\right)-{g}^{\prime }\left({t}_{n}\right)\right],\hfill \\ {t}_{n+1}={s}_{n}-\frac{1}{2}{H}_{g}\left({t}_{n},{s}_{n}\right)\left[I-\lambda {H}_{g}\left({t}_{n},{s}_{n}\right)\right]\left({s}_{n}-{t}_{n}\right).\hfill \end{array}$
(6)

Then for $\eta \le \beta$, $\left\{{t}_{n}\right\}$, $\left\{{s}_{n}\right\}$ are increasing and converge to ${r}_{1}$.

Lemma 3 Suppose that $F\left(x\right)$ satisfies conditions (4) of Theorem  1, $\mathrm{\forall }x\in B\left({x}_{0},{r}_{1}\right)$, ${F}^{\prime }{\left(x\right)}^{-1}$ exists and satisfies the following inequalities:

$\begin{array}{ll}\left(\mathrm{I}\right)& \parallel {F}^{\prime }{\left(x\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)\parallel \le -{g}^{\prime }{\left(\parallel x-{x}_{0}\parallel \right)}^{-1},\\ \left(\mathrm{II}\right)& \parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left(x\right)\parallel \le {g}^{″}\left(\parallel x-{x}_{0}\parallel \right).\end{array}$

Proof

$\begin{array}{rcl}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left(x\right)\parallel & =& \parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{0}\right)+{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left(x\right)-{F}^{″}\left({x}_{0}\right)\right]\parallel \\ \le & \gamma +N\parallel x-{x}_{0}\parallel \le \gamma +K\parallel x-{x}_{0}\parallel ={g}^{″}\left(\parallel x-{x}_{0}\parallel \right).\end{array}$

By the proof process of Lemma 1, we get ${g}^{\prime }\left(t\right)<0$, $t\in \left[0,{r}_{1}\right)$. Hence, for $x\in B\left({x}_{0},{r}_{1}\right)$,

$\begin{array}{c}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{\prime }\left(x\right)-I\parallel \hfill \\ \phantom{\rule{1em}{0ex}}=\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{\prime }\left(x\right)-{F}^{\prime }\left({x}_{0}\right)-{F}^{″}\left({x}_{0}\right)\left(x-{x}_{0}\right)+{F}^{″}\left({x}_{0}\right)\left(x-{x}_{0}\right)\right]\parallel \hfill \\ \phantom{\rule{1em}{0ex}}\le \parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{0}+t\left(x-{x}_{0}\right)\right)-{F}^{″}\left({x}_{0}\right)\right]\phantom{\rule{0.2em}{0ex}}dt\left(x-{x}_{0}\right)\parallel +\gamma \parallel x-{x}_{0}\parallel \hfill \\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}Nt\phantom{\rule{0.2em}{0ex}}dt{\parallel x-{x}_{0}\parallel }^{2}+\gamma \parallel x-{x}_{0}\parallel \le \frac{1}{2}K{\parallel x-{x}_{0}\parallel }^{2}+\gamma \parallel x-{x}_{0}\parallel \hfill \\ \phantom{\rule{1em}{0ex}}=1+{g}^{\prime }\left(\parallel x-{x}_{0}\parallel \right)<1.\hfill \end{array}$

By the Banach theorem, we know ${\left({F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{\prime }\left(x\right)\right)}^{-1}={F}^{\prime }{\left(x\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)$ exists, and

$\parallel {F}^{\prime }{\left(x\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)\parallel \le \frac{1}{1-\parallel I-{F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{\prime }\left(x\right)\parallel }\le -{g}^{\prime }{\left(\parallel x-{x}_{0}\parallel \right)}^{-1}.$

This completes the proof of Lemma 3. □

Lemma 4 Suppose that the nonlinear operator $F:\mathrm{\Omega }\subset X\to Y$ is defined on an open convex subset Ω of a Banach space X with values in a Banach space Y, F has the second-order Frechét derivative, and the sequences $\left\{{x}_{n}\right\}$, $\left\{{y}_{n}\right\}$ are generated by (3). Then the following formula holds for all natural numbers n:

$\begin{array}{rcl}F\left({x}_{n+1}\right)& =& {\int }_{0}^{1}{F}^{″}\left({y}_{n}+t\left({x}_{n+1}-{y}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({x}_{n+1}-{y}_{n}\right)}^{2}\\ +{\int }_{0}^{1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)-\frac{1}{2}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt{\left({y}_{n}-{x}_{n}\right)}^{2}\\ -\frac{1-\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\\ -\frac{1}{2}{\int }_{0}^{1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)\\ ×H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\\ +\frac{\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right).\end{array}$

Proof

$\begin{array}{c}F\left({x}_{n+1}\right)=F\left({x}_{n+1}\right)-F\left({y}_{n}\right)-{F}^{\prime }\left({y}_{n}\right)\left({x}_{n+1}-{y}_{n}\right)+F\left({y}_{n}\right)+{F}^{\prime }\left({y}_{n}\right)\left({x}_{n+1}-{y}_{n}\right)\hfill \\ \phantom{F\left({x}_{n+1}\right)}={\int }_{0}^{1}{F}^{″}\left({y}_{n}+t\left({x}_{n+1}-{y}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({x}_{n+1}-{y}_{n}\right)}^{2}+F\left({y}_{n}\right)+F\left({y}_{n}^{\prime }\right)\left({x}_{n+1}-{y}_{n}\right),\hfill \\ {F}^{\prime }\left({x}_{n}\right)H\left({x}_{n},{y}_{n}\right)=\frac{1}{p}\left[{F}^{\prime }\left({x}_{n}+p\left({y}_{n}-{x}_{n}\right)\right)-{F}^{\prime }\left({x}_{n}\right)\right]={\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right),\hfill \\ F\left({y}_{n}\right)+F\left({y}_{n}^{\prime }\right)\left({x}_{n+1}-{y}_{n}\right)\hfill \\ \phantom{\rule{1em}{0ex}}=F\left({y}_{n}\right)-F\left({x}_{n}\right)-{F}^{\prime }\left({x}_{n}\right){\left({y}_{n}-{x}_{n}\right)}^{2}-\frac{1}{2}{F}^{\prime }\left({y}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left[I-\lambda H\left({x}_{n},{y}_{n}\right)\right]\left({y}_{n}-{x}_{n}\right)\hfill \\ \phantom{\rule{1em}{0ex}}={\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({y}_{n}-{x}_{n}\right)}^{2}\hfill \\ \phantom{\rule{2em}{0ex}}-\frac{1}{2}\left[{F}^{\prime }\left({y}_{n}\right)-{F}^{\prime }\left({x}_{n}\right)\right]H\left({x}_{n},{y}_{n}\right)\left[I-\lambda H\left({x}_{n},{y}_{n}\right)\right]\left({y}_{n}-{x}_{n}\right)\hfill \\ \phantom{\rule{2em}{0ex}}-\frac{1}{2}{F}^{\prime }\left({x}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left[I-\lambda H\left({x}_{n},{y}_{n}\right)\right]\left({y}_{n}-{x}_{n}\right)\hfill \\ \phantom{\rule{1em}{0ex}}={\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({y}_{n}-{x}_{n}\right)}^{2}-\frac{1}{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({y}_{n}-{x}_{n}\right)}^{2}\hfill \\ \phantom{\rule{2em}{0ex}}+\frac{\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\hfill \\ \phantom{\rule{2em}{0ex}}-\frac{1}{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\hfill \\ \phantom{\rule{2em}{0ex}}+\frac{\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right).\hfill \end{array}$

Hence,

$\begin{array}{rcl}F\left({x}_{n+1}\right)& =& {\int }_{0}^{1}{F}^{″}\left({y}_{n}+t\left({x}_{n+1}-{y}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({x}_{n+1}-{y}_{n}\right)}^{2}\\ +{\int }_{0}^{1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)-\frac{1}{2}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt{\left({y}_{n}-{x}_{n}\right)}^{2}\\ -\frac{1-\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\\ -\frac{1}{2}{\int }_{0}^{1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)\\ ×H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right)\\ +\frac{\lambda }{2}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({y}_{n}-{x}_{n}\right)H\left({x}_{n},{y}_{n}\right)H\left({x}_{n},{y}_{n}\right)\left({y}_{n}-{x}_{n}\right).\end{array}$

This completes the proof of Lemma 4. □

Proof of Theorem 1 By induction, we can prove, for $n\ge 0$, that the following formulae hold:

$\begin{array}{ll}\left({\mathrm{I}}_{n}\right):& {x}_{n}\in \overline{S\left({x}_{0},{t}_{n}\right)},\\ \left({\mathrm{II}}_{n}\right):& \parallel {F}^{\prime }{\left({x}_{n}\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)\parallel \le -{g}^{\prime }{\left({t}_{n}\right)}^{-1},\phantom{\rule{2em}{0ex}}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{n}\right)\parallel \le {g}^{″}\left({t}_{n}\right),\\ \left({\mathrm{III}}_{n}\right):& \parallel {y}_{n}-{x}_{n}\parallel \le {s}_{n}-{t}_{n},\\ \left({\mathrm{IV}}_{n}\right):& {y}_{n}\in \overline{S\left({x}_{0},{s}_{n}\right)},\\ \left({\mathrm{V}}_{n}\right):& \parallel {x}_{n+1}-{y}_{n}\parallel \le {t}_{n+1}-{s}_{n}.\end{array}$

In fact, by Lemma 2, we know that $\left\{{t}_{n}\right\}$ is increasing and converges to the minimum positive root of the function $g\left(t\right)$. Hence, ${t}_{n}<{r}_{1}$ for all natural numbers n. It is easy to verify it for the case $n=0$. By using mathematical induction, we now suppose the formulae above also hold for $n\ge 0$. Then

$\left({\mathrm{I}}_{n+1}\right):\phantom{\rule{1em}{0ex}}\parallel {x}_{n+1}-{x}_{0}\parallel \le \parallel {x}_{n+1}-{y}_{n}\parallel +\parallel {y}_{n}-{x}_{0}\parallel \le {t}_{n+1}-{s}_{n}+{s}_{n}={t}_{n+1}.$

By Lemma 3, and the fact that $-{g}^{\prime }{\left(t\right)}^{-1}$, ${g}^{″}\left(t\right)$ are increasing on $\left[0,{r}_{1}\right]$, we get (${\mathrm{II}}_{n+1}$).

$\begin{array}{ll}\left({\mathrm{III}}_{n+1}\right):& \parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({y}_{n}+t\left({x}_{n+1}-{y}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({x}_{n+1}-{y}_{n}\right)}^{2}\parallel \\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}{g}^{″}\left(\parallel {y}_{n}-{x}_{0}+t\left({x}_{n+1}-{y}_{n}\right)\parallel \right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}\\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2},\\ \parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)-\frac{1}{2}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt\parallel \\ \phantom{\rule{1em}{0ex}}\le \parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{″}\left({x}_{n}\right)\right]\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt\parallel \\ \phantom{\rule{2em}{0ex}}+\frac{1}{2}\parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)-{F}^{″}\left({x}_{n}\right)\right]\phantom{\rule{0.2em}{0ex}}dt\parallel \\ \phantom{\rule{1em}{0ex}}\le \frac{\left(2+3p\right)N}{12}\parallel {y}_{n}-{x}_{n}\parallel ,\\ {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt\\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}N\left(1-p\right)t\parallel {y}_{n}-{x}_{n}\parallel \phantom{\rule{0.2em}{0ex}}dt\le \frac{\left(1-p\right)N}{2}\left({s}_{n}-{t}_{n}\right),\\ \parallel H\left({x}_{n},{y}_{n}\right)\parallel =\parallel {F}^{\prime }{\left({x}_{n}\right)}^{-1}{\int }_{0}^{1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\left({y}_{n}-{x}_{n}\right)\phantom{\rule{0.2em}{0ex}}dt\parallel \\ \phantom{\parallel H\left({x}_{n},{y}_{n}\right)\parallel }=\parallel {F}^{\prime }{\left({x}_{n}\right)}^{-1}{F}^{\prime }\left({x}_{0}\right){\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\left({y}_{n}-{x}_{n}\right)\phantom{\rule{0.2em}{0ex}}dt\parallel \\ \phantom{\parallel H\left({x}_{n},{y}_{n}\right)\parallel }\le -{g}^{\prime }{\left({t}_{n}\right)}^{-1}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({s}_{n}-{t}_{n}\right)=-{H}_{g}\left({t}_{n},{s}_{n}\right).\end{array}$

From Lemmas 3 and 4, we get

$\begin{array}{rcl}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}F\left({x}_{n+1}\right)\parallel & \le & {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}+\frac{\left(2+3p\right)N}{12}{\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{1-\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{2}\\ +\frac{1}{2}\frac{\left(1-p\right)N}{2}\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+t\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({s}_{n}-{t}_{n}\right)}^{2}{H}_{g}^{2}\left({t}_{n},{s}_{n}\right)\\ \le & {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}\\ +\frac{\left(2-3p\right)}{12}\cdot \frac{2+3p}{2-3p}N{\left({s}_{n}-{t}_{n}\right)}^{3}\\ -\frac{1-\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{2}\\ -\frac{1}{2}\frac{\left(1-p\right)K}{2}{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+t\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({s}_{n}-{t}_{n}\right)}^{2}{H}_{g}^{2}\left({t}_{n},{s}_{n}\right)\\ \le & {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}+\frac{\left(2-3p\right)K}{12}{\left({s}_{n}-{t}_{n}\right)}^{3}\\ -\frac{1-\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{2}\\ -\frac{1}{2}\frac{\left(1-p\right)K}{2}{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+t\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({s}_{n}-{t}_{n}\right)}^{2}{H}_{g}^{2}\left({t}_{n},{s}_{n}\right)\\ =& g\left({t}_{n+1}\right).\end{array}$

Hence, we get

$\begin{array}{c}\parallel {y}_{n+1}-{x}_{n+1}\parallel \le \parallel -{F}^{\prime }{\left({x}_{n+1}\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)\parallel \parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}F\left({x}_{n+1}\right)\parallel \hfill \\ \phantom{\parallel {y}_{n+1}-{x}_{n+1}\parallel }\le -{g}^{\prime }{\left({t}_{n+1}\right)}^{-1}g\left({t}_{n+1}\right)={s}_{n+1}-{t}_{n+1},\hfill \\ \begin{array}{ll}\left({\mathrm{IV}}_{n+1}\right):& \parallel {y}_{n+1}-{x}_{0}\parallel \le \parallel {y}_{n+1}-{x}_{n+1}\parallel +\parallel {x}_{n+1}-{x}_{0}\parallel \le \left({s}_{n+1}-{t}_{n+1}\right)+{t}_{n+1}={s}_{n+1},\\ \left({\mathrm{V}}_{n+1}\right):& \parallel {x}_{n+2}-{y}_{n+1}\parallel =\parallel -\frac{1}{2}H\left({x}_{n+1},{y}_{n+1}\right)\left[I-\lambda H\left({x}_{n+1},{y}_{n+1}\right)\right]\left({y}_{n+1}-{x}_{n+1}\right)\parallel \\ \phantom{\parallel {x}_{n+2}-{y}_{n+1}\parallel }\le \frac{1}{2}\parallel H\left({x}_{n+1},{y}_{n+1}\right)\parallel \left[1+\lambda \parallel H\left({x}_{n+1},{y}_{n+1}\right)\parallel \right]\parallel \left({y}_{n+1}-{x}_{n+1}\right)\parallel \\ \phantom{\parallel {x}_{n+2}-{y}_{n+1}\parallel }\le -\frac{1}{2}{H}_{g}\left({t}_{n+1},{s}_{n+1}\right)\left[1-\lambda {H}_{g}\left({t}_{n+1},{s}_{n+1}\right)\right]\left({s}_{n+1}-{t}_{n+1}\right)\\ \phantom{\parallel {x}_{n+2}-{y}_{n+1}\parallel }={t}_{n+2}-{s}_{n+1}.\end{array}\hfill \end{array}$

So, the sequence ${\left\{{x}_{n}\right\}}_{n\ge 0}$ generated by (3) is well defined, ${x}_{n}\in \overline{S\left({x}_{0},{r}_{1}\right)}$ and $\left\{{x}_{n}\right\}$ converges to the solution ${x}^{\ast }$ of equation (1) on $\overline{S\left({x}_{0},{r}_{1}\right)}$. Now, we prove the uniqueness. If ${y}^{\ast }$ is also the solution of equation (1) in $S\left({x}_{0},\alpha \right)$, then, by the proof of Lemma 1, we know that ${g}^{\prime }\left(t\right)<0$, $t\in \left[0,\alpha \right)$.

Thus,

$\begin{array}{c}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{\int }_{0}^{1}{F}^{\prime }\left({x}^{\ast }+t\left({y}^{\ast }-{x}^{\ast }\right)\right)\phantom{\rule{0.2em}{0ex}}dt-I\parallel \hfill \\ \phantom{\rule{1em}{0ex}}=\parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left\{{F}^{\prime }\left[{x}^{\ast }+t\left({y}^{\ast }-{x}^{\ast }\right)\right]-{F}^{\prime }\left({x}_{0}\right)\right\}\phantom{\rule{0.2em}{0ex}}dt\parallel \hfill \\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}{\int }_{0}^{1}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left\{{x}_{0}+s\left[{x}^{\ast }+t\left({y}^{\ast }-{x}^{\ast }\right)-{x}_{0}\right]\right\}\parallel \parallel {x}^{\ast }-{x}_{0}+t\left({y}^{\ast }-{x}^{\ast }\right)\parallel \phantom{\rule{0.2em}{0ex}}ds\phantom{\rule{0.2em}{0ex}}dt\hfill \\ \phantom{\rule{1em}{0ex}}\le {\int }_{0}^{1}{\int }_{0}^{1}{g}^{″}\left[s\parallel {x}^{\ast }-{x}_{0}+t\left({y}^{\ast }-{x}^{\ast }\right)\parallel \right]\parallel {x}^{\ast }-{x}_{0}+t\left({y}^{\ast }-{x}^{\ast }\right)\parallel \phantom{\rule{0.2em}{0ex}}ds\phantom{\rule{0.2em}{0ex}}dt\hfill \\ \phantom{\rule{1em}{0ex}}={\int }_{0}^{1}\left\{{g}^{\prime }\left[\parallel {x}^{\ast }-{x}_{0}+t\left({y}^{\ast }-{x}^{\ast }\right)\parallel \right]-{g}^{\prime }\left(0\right)\right\}\phantom{\rule{0.2em}{0ex}}dt\hfill \\ \phantom{\rule{1em}{0ex}}={\int }_{0}^{1}\left\{{g}^{\prime }\left[\parallel \left(1-t\right)\left({x}^{\ast }-{x}_{0}\right)+t\left({y}^{\ast }-{x}_{0}\right)\parallel \right]\right\}\phantom{\rule{0.2em}{0ex}}dt+1<1.\hfill \end{array}$

By the Banach theorem, we know the inverse of

${\int }_{0}^{1}{F}^{\prime }\left[{x}^{\ast }+t\left({y}^{\ast }-{x}^{\ast }\right)\right]\phantom{\rule{0.2em}{0ex}}dt$

exists. Since

$0=F\left({y}^{\ast }\right)-F\left({x}^{\ast }\right)={\int }_{0}^{1}{F}^{\prime }\left[{x}^{\ast }+t\left({y}^{\ast }-{x}^{\ast }\right)\right]\phantom{\rule{0.2em}{0ex}}dt\left({y}^{\ast }-{x}^{\ast }\right),$

we have ${y}^{\ast }={x}^{\ast }$. This completes the proof of uniqueness. Thus, the proof of Theorem 1 is complete. □

Proof of Theorem 2 We know that $F:\mathrm{\Omega }\subset X\to Y$ has the three-order Fréchet derivative. Then

$\begin{array}{c}-{H}_{g}\left({x}_{n},{y}_{n}\right)=-{g}^{\prime }{\left({t}_{n}\right)}^{-1}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left({s}_{n}-{t}_{n}\right)\hfill \\ \phantom{-{H}_{g}\left({x}_{n},{y}_{n}\right)}=\frac{1}{1-\gamma {t}_{n}-\frac{1}{2}{t}_{n}^{2}}{\int }_{0}^{1}\left[K\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)+\gamma \right]\phantom{\rule{0.2em}{0ex}}dt\left({s}_{n}-{t}_{n}\right)\ge \gamma \left({s}_{n}-{t}_{n}\right)>0,\hfill \\ \parallel {\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{″}\left({x}_{n}+t\left({y}_{n}-{x}_{n}\right)\right)\left(1-t\right)-\frac{1}{2}{F}^{″}\left({x}_{n}+pt\left({y}_{n}-{x}_{n}\right)\right)\right]\phantom{\rule{0.2em}{0ex}}dt\parallel \hfill \\ \phantom{\rule{1em}{0ex}}\le \parallel {\int }_{0}^{1}{\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{‴}\left({x}_{n}+\sigma t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{‴}\left({x}_{n}\right)\right]t\left(1-t\right)\phantom{\rule{0.2em}{0ex}}d\sigma \phantom{\rule{0.2em}{0ex}}dt\parallel \parallel {y}_{n}-{x}_{n}\parallel \hfill \\ \phantom{\rule{2em}{0ex}}+\frac{p}{2}\parallel {\int }_{0}^{1}{\int }_{0}^{1}{F}^{\prime }{\left({x}_{0}\right)}^{-1}\left[{F}^{‴}\left({x}_{n}+p\sigma t\left({y}_{n}-{x}_{n}\right)\right)-{F}^{‴}\left({x}_{n}\right)\right]t\phantom{\rule{0.2em}{0ex}}d\sigma \phantom{\rule{0.2em}{0ex}}dt\parallel \parallel {y}_{n}-{x}_{n}\parallel \hfill \\ \phantom{\rule{2em}{0ex}}+\frac{2-3p}{12}{F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{‴}\left({x}_{n}\right)\parallel {y}_{n}-{x}_{n}\parallel \hfill \\ \phantom{\rule{1em}{0ex}}\le \frac{\left(1+2{p}^{2}\right)L}{24}{\parallel {y}_{n}-{x}_{n}\parallel }^{2}+\frac{2-3p}{12}N\parallel {y}_{n}-{x}_{n}\parallel ,\hfill \\ \frac{\left(1+2{p}^{2}\right)L}{24}{\parallel {y}_{n}-{x}_{n}\parallel }^{4}+\frac{1}{2}\frac{\left(1-p\right)N}{2}\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{3}\hfill \\ \phantom{\rule{1em}{0ex}}\le \left[\frac{\left(1+2{p}^{2}\right)L}{6\left(1-p\right)}\frac{\left({s}_{n}-{t}_{n}\right)}{\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right)}+N\right]\frac{\left(1-p\right)}{4}\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{3}\hfill \\ \phantom{\rule{1em}{0ex}}\le \left[\frac{\left(1+2{p}^{2}\right)L}{6\gamma \left(1-p\right)}+N\right]\frac{\left(1-p\right)}{4}\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{3}\hfill \\ \phantom{\rule{1em}{0ex}}\le -\frac{\left(1-p\right)}{4}K{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{3}.\hfill \end{array}$

Hence,

$\begin{array}{rcl}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}F\left({x}_{n+1}\right)\parallel & \le & {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}\\ +\frac{\left(1+2{p}^{2}\right)L}{24}{\parallel {y}_{n}-{x}_{n}\parallel }^{4}+\frac{2-3p}{12}N{\parallel {y}_{n}-{x}_{n}\parallel }^{3}\\ +\frac{1-\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{2}\\ +\frac{1}{2}\frac{\left(1-p\right)N}{2}\left(-{H}_{g}\left({t}_{n},{s}_{n}\right)\right){\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+t\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({s}_{n}-{t}_{n}\right)}^{2}{H}_{g}^{2}\left({t}_{n},{s}_{n}\right)\\ \le & {\int }_{0}^{1}{g}^{″}\left({s}_{n}+t\left({t}_{n+1}-{s}_{n}\right)\right)\left(1-t\right)\phantom{\rule{0.2em}{0ex}}dt{\left({t}_{n+1}-{s}_{n}\right)}^{2}+\frac{\left(2-3p\right)}{12}K{\left({s}_{n}-{t}_{n}\right)}^{3}\\ -\frac{1-\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+pt\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{2}\\ -\frac{\left(1-p\right)K}{4}{H}_{g}\left({t}_{n},{s}_{n}\right){\left({s}_{n}-{t}_{n}\right)}^{3}\\ +\frac{\lambda }{2}{\int }_{0}^{1}{g}^{″}\left({t}_{n}+t\left({s}_{n}-{t}_{n}\right)\right)\phantom{\rule{0.2em}{0ex}}dt{\left({s}_{n}-{t}_{n}\right)}^{2}{H}_{g}^{2}\left({t}_{n},{s}_{n}\right)=g\left({t}_{n+1}\right).\end{array}$

Using the same proof method as in Theorem 1, we get assertion of Theorem 2. □

## 2 Numerical examples

In this section, we apply the convergence ball result and show two numerical examples.

Example 1 Suppose that $F\left(x\right)=\frac{1}{6}{x}^{3}+\frac{1}{6}{x}^{2}-\frac{5}{6}x+\frac{1}{3}=0$, we consider initial point ${x}_{0}=0$, $\mathrm{\Omega }=\left[-1,1\right]$. We can choose

$\eta =\gamma =\frac{2}{5},\phantom{\rule{2em}{0ex}}N=\frac{6}{5},\phantom{\rule{2em}{0ex}}L=0.$
(7)

Hence,

$K=N=\frac{6}{5},\phantom{\rule{2em}{0ex}}\beta =\frac{2\left(\gamma +2\sqrt{{\gamma }^{2}+2K}\right)}{3{\left(\gamma +\sqrt{{\gamma }^{2}+2K}\right)}^{2}}=\frac{3}{5},\phantom{\rule{2em}{0ex}}\eta <\beta .$
(8)

Moreover, by Theorem 2, we get that the sequence ${x}_{n}$ ($n\ge 0$) generated by (3) is well defined and convergent.

Example 2 Consider the following integral equations

$x\left(s\right)=1+\frac{1}{4}x\left(s\right){\int }_{0}^{1}\frac{s}{s+t}x\left(t\right)\phantom{\rule{0.2em}{0ex}}dt$
(9)

and the space $X=C\left[0,1\right]$ with the norm

$\parallel x\parallel =\underset{0\le s\le 1}{max}|x\left(s\right)|.$
(10)

This equation arises in the theory of the radiative transfer, neutron transport and in the kinetic theory of gasses. Let us define the operator F on X by

$F\left(x\right)=\frac{1}{4}x\left(s\right){\int }_{0}^{1}\frac{s}{s+t}x\left(t\right)\phantom{\rule{0.2em}{0ex}}dt-x\left(s\right)+1.$
(11)

Then, for ${x}_{0}=1$, we get the following results:

$\begin{array}{c}N=0,\phantom{\rule{2em}{0ex}}L=0,\phantom{\rule{2em}{0ex}}K=0,\phantom{\rule{2em}{0ex}}\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}\parallel =1.5304,\hfill \\ \eta =\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{\prime }\left({x}_{0}\right)\parallel =0.2652,\hfill \\ \gamma =\parallel {F}^{\prime }{\left({x}_{0}\right)}^{-1}{F}^{″}\left({x}_{0}\right)\parallel =1.5304×2\cdot \frac{1}{4}\underset{0\le s\le 1}{max}|{\int }_{0}^{1}\frac{s}{s+t}\phantom{\rule{0.2em}{0ex}}dt|=0.5303,\hfill \\ \frac{2\left(\gamma +2\sqrt{{\gamma }^{2}+2K}\right)}{3{\left(\gamma +\sqrt{{\gamma }^{2}+2K}\right)}^{2}}=0.9429>\eta ,\hfill \end{array}$

this means that the hypotheses of Theorem 2 hold.

## References

1. Ortega JM, Rheinbolt WC: Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, New York; 1970.

2. Cordero A, Hueso JL, Martinez E, Torregrosa JR: Increasing the convergence order of an iterative method for nonlinear systems. Appl. Math. Lett. 2012, 25: 2369-2374. 10.1016/j.aml.2012.07.005

3. Babajee DKR, Cordero A, Soleymani F, Torregrosa JR: On a novel fourth-order algorithm for solving systems of nonlinear equations. J. Appl. Math. 2012., 2012: Article ID 165452

4. Cordero A, Torregrosa JR, Vindel P: Study of the dynamics of third-order iterative methods on quadratic polynomials. Int. J. Comput. Math. 2012, 89: 1826-1836. 10.1080/00207160.2012.687446

5. Soleymani F: Some efficient seventh-order derivative-free families in root-finding. Opusc. Math. 2013, 33: 163-173. 10.7494/OpMath.2013.33.1.163

6. Soleymani F, Karimi Vanani S: A modified eighth-order derivative-free root solver. Thai J. Math. 2012, 10: 541-549.

7. Kantorovich L: On Newton method. Tr. Mat. Inst. Steklova 1949, 28: 104-144.

8. Wu QB, Zhao YQ: Third-order convergence theorem by using majorizing function for a modified Newton method in Banach space. Appl. Math. Comput. 2006, 175: 1515-1524. 10.1016/j.amc.2005.08.043

9. Ezquerro JA, Hemández MA: On the R-order of Halley method. J. Math. Anal. Appl. 2005, 303: 591-601. 10.1016/j.jmaa.2004.08.057

10. Gutierrez JM, Hernandez MA: Recurrence relations for the Super-Halley method. Comput. Math. Appl. 1998, 36: 1-8.

11. Wu QB, Zhao YQ: Newton-Kantorovich-type convergence theorem for a family of new deformed Chebyshev method. Appl. Math. Comput. 2007, 192: 405-412. 10.1016/j.amc.2007.03.018

12. Guo X: The convergence for the second-order-derivative-free iterations. J. Eng. Math. 2001, 18: 29-34. (in Chinese)

13. Argyros IK, Chen D: Results on the Chebyshev method in Banach spaces. Proyecciones 1993, 12: 119-128.

14. Ezquerro JA, et al.: The application of an inverse-free Jarratt-type approximation to nonlinear integral equations of Hammersteintype. Comput. Math. Appl. 1998, 36: 9-20.

15. Argyros IK: A new convergence theorem for the Jarratt method in Banach space. Comput. Math. Appl. 1998, 36: 13-18.

16. Ezquerro JA, Gonzalez D, Hernandez MA: A modification of the classic conditions of Newton-Kantorovich for Newton’s method. Math. Comput. Model. 2013, 57: 584-594. 10.1016/j.mcm.2012.07.015

17. Ezquerro JA, Gonzalez D, Hernandez MA: A variant of the Newton-Kantorovich theorem for nonlinear integral equations of mixed Hammerstein type. Appl. Math. Comput. 2012, 218: 9536-9546. 10.1016/j.amc.2012.03.049

18. Homeier HHH: A modified Newton method with cubic convergence: the multivariate case. J. Comput. Appl. Math. 2004, 169: 161-169. 10.1016/j.cam.2003.12.041

19. Weerakoon S, Fernando TGI: A variant of Newton’s method with accelerated third-order convergence. Appl. Math. Lett. 2000, 13: 87-93.

20. Frontini M, Sormani E: Some variant of Newton’s method with third-order convergence. Appl. Math. Comput. 2003, 140: 419-426. 10.1016/S0096-3003(02)00238-2

21. Kou JS, Li YT, Wang XH: On modified Newton methods with cubic convergence. Appl. Math. Comput. 2006, 176: 123-127. 10.1016/j.amc.2005.09.052

22. Chen M, Khan Y, Wu Q, Yildirim A: Newton-Kantorovich convergence theorem of a modified Newton’s method under the gamma-condition in a Banach space. J. Optim. Theory Appl. 2013. 10.1007/s10957-012-0237-9

23. Chen D, Argyros IK, Qian QS: A note on the Halley method in Banach spaces. Appl. Math. Comput. 2004, 58: 215-224.

24. Argyros IK: The super-Halley method using divided differences. Appl. Math. Lett. 1997, 10: 91-95.

25. Gutiérrez JM, Hernández MA: Recurrence relations for the super-Halley method. Comput. Math. Appl. 1998, 36: 1-8.

26. Hernández MA, Salanova MA: Indices of convexity and concavity: application to Halley method. Appl. Math. Comput. 1999, 103: 27-49. 10.1016/S0096-3003(98)10047-4

27. Ezquerro JA, Hernández MA: A modification of the super-Halley method under mild differentiability conditions. J. Comput. Appl. Math. 2000, 114: 405-409. 10.1016/S0377-0427(99)00348-9

28. Ezquerro JA, Hernández MA: On the R-order of the Halley method. J. Comput. Appl. Math. 2005, 303: 591-601.

29. Kou JS, Li YT, Wang XH: Modified Halley’s method free from second derivative. Appl. Math. Comput. 2006, 183: 704-708. 10.1016/j.amc.2006.05.097

30. Gutiérrez JM, Hernández MA: An acceleration of Newton’s method: super-Halley method. J. Appl. Math. Comput. 2001, 117: 223-239. 10.1016/S0096-3003(99)00175-7

## Acknowledgements

This work is supported by the National Basic Research 973 Program of China (No. 2011JB105001), the National Natural Science Foundation of China (Grant No. 11371320), the Foundation of Science and Technology Department (Grant No. 2013C31084) of Zhejiang Province and the Foundation of the Education Department (No. 20120040, Y201329420) of Zhejiang Province of China and by the Grant FEKT-S-11-2-921 of Faculty of Electrical Engineering and Communication, Brno University of Technology, Czech Republic.

## Author information

Authors

### Corresponding author

Correspondence to Rongfei Lin.

### Competing interests

The authors declare that they have no competing interests.

### Authors’ contributions

The authors have made the same contribution. All authors read and approved the final manuscript.

## Rights and permissions

Reprints and Permissions

Lin, R., Zhao, Y., Šmarda, Z. et al. Newton-Kantorovich convergence theorem of a new modified Halley’s method family in a Banach space. Adv Differ Equ 2013, 325 (2013). https://doi.org/10.1186/1687-1847-2013-325

• Accepted:

• Published:

• DOI: https://doi.org/10.1186/1687-1847-2013-325

### Keywords

• Banach Space
• Iterative Method 