A note on the complexity of a phaseless polynomial interpolation

A note on the complexity of a phaseless polynomial interpolation

Journal of Complexity xxx (xxxx) xxx Contents lists available at ScienceDirect Journal of Complexity journal homepage: www.elsevier.com/locate/jco ...

623KB Sizes 1 Downloads 96 Views

Journal of Complexity xxx (xxxx) xxx

Contents lists available at ScienceDirect

Journal of Complexity journal homepage: www.elsevier.com/locate/jco

A note on the complexity of a phaseless polynomial interpolation✩ ∗

Michał R. Przybyłek a , , Paweł Siedlecki b a

Institute of Informatics, Department of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland b Institute of Applied Mathematics and Mechanics, Department of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland

article

info

Article history: Received 19 August 2019 Received in revised form 23 October 2019 Accepted 19 November 2019 Available online xxxx Keywords: Information-based complexity Absolute value information Group action Polynomial interpolation

a b s t r a c t In this paper we revisit the classical problem of polynomial interpolation, with a slight twist; namely, polynomial evaluations are available up to a group action of the unit circle on the complex plane. It turns out that this new setting allows for a phaseless recovery of a polynomial in a polynomial time. © 2019 Elsevier Inc. All rights reserved.

1. Introduction Polynomial interpolation is a classical computational problem considered in numerical mathematics: given a set of points and values find a polynomial of a given degree that assumes given values at given points. A common interpretation is that a polynomial is fitted to some observational data. It is tacitly assumed that a data set is given exactly. Hence if, for instance, we are fitting a polynomial to some measured observable, our data set faithfully represents measured values. However, in some applications (e.g., signal processing, speech recognition, complex quantities processing) our measurements do not faithfully represent all characteristics of a measured signal, namely, they lack a phase. This observation has led to the emergence of a new subject of phase retrieval, which in short can be described as a study of what can be reconstructed if measurements are restricted to magnitudes while phases are lost. More on phase retrieval can be found, e.g., in [2]. ✩ Communicated by E. Novak. ∗ Corresponding author. E-mail addresses: [email protected] (M.R. Przybyłek), [email protected] (P. Siedlecki). https://doi.org/10.1016/j.jco.2019.101456 0885-064X/© 2019 Elsevier Inc. All rights reserved.

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

2

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

Recently the problem of phase retrieval has inspired a study of problems and algorithms using absolute value information from the point of view of information-based complexity (IBC). While most information-based complexity research concerned problems with information given as evaluations of linear functionals, [4] consider problems and algorithms that rely on information consisting of modules of real or complex valued linear functionals. All polynomials that differ by a phase have the same absolute values at every point, therefore we cannot hope for the full polynomial recovery based on absolute values only and we have to relax the setting. In this paper we study phaseless polynomial interpolation understood as an algorithmic task to construct a polynomial up to a phase, based on its values that are available up to a phase. This means that for a set of absolute values of evaluations of an unknown polynomial f , i.e., |f (x1 )|, . . . , |f (xk )| for some points x1 , . . . , xk , we wish to find uf for some number u with |u| = 1. We analyse the relation between the degree of a polynomial f and the computational effort needed to perform this phaseless interpolation. We show that it is possible to do it in a polynomial time. It turns out that while the result for real polynomials is quite straightforward, the complex case is more subtle; for instance, we cannot rely on evaluations given at arbitrary distinct points. The paper is organized as follows. The Section 2 describes the computational framework for the problems of polynomial identification. Section 3 studies phaseless identification of real polynomials, whereas Section 4 investigates phaseless identification of complex polynomials. We conclude the paper in Section 5 and indicate areas of further work. 2. Computational framework and problem statement To fix the terminology let us recall some general notions of IBC. Since in this work we are dealing with problems that are meant to be solved exactly, we will not need much besides basic IBC notions. We are interested in identification problems, i.e., problems of the form idV : V → V and their solutions that consist of an information operator N: V → K∗ and an algorithm φ : K∗ → V such that φ ◦ N = idV . Here V is a set, K is either the field of real numbers R or the field of complex numbers C, and K∗ is the set of all finite sequences of elements of K (the unique sequence of length 0, i.e. the empty sequence, will be denoted by ∅). Let Λ be some class of elementary information operations that can act on elements of V , i.e., elements of Λ are maps V → K. Recall that, following common IBC convention, an information operator is a mapping N: V → K∗ such that for all v ∈ V we have N(v ) = L∅ (v ), Ly1 (v ), . . . , Ly1 ,...,yn(v)−1 (v )

(

)

where L(−) : K∗ → Λ and yk = Ly1 ,...,yk−1 (v ) for k ∈ {1, . . . , n(v )}. At every step of a construction of N(v ) the decision whether to evaluate another elementary information operation or stop the computation of N(v ) is based only on previous evaluations. Note that the number n(v ) of evaluations depends on v via the sequence of intermediate values of elementary information operations y1 , . . . , yk . More on information operators and their role in IBC in more general settings can be found, e.g., in [5] and [3]. Remark 2.1 (Representations). All objects that we will deal with can be described as finite sequences of real numbers as follows. We treat complex numbers as pairs of real numbers via the usual correspondence z ↔ (Re(z), Im(z)). Similarly, we treat real polynomials as finite sequences of reals via a0 + a1 x + · · · + an xn ↔ (a0 , a1 . . . , an ), and complex polynomials as finite sequences of reals via a0 + a1 x + · · · + an xn ↔ (Re(a0 ), Im(a0 ), Re(a1 ), Im(a1 ), . . . , Re(an ), Im(an )). For this reason, we assume that K, V ⊆ R∗ and our algorithms induce partial functions φ : R∗ → R∗ . For our computational √ model we choose the real Blum–Shub–Smale machine augmented with the an algorithm has to be realizable in this model. ability to compute (·) and sin(·) and assume that√ We will also, by a slight abuse of the term, call (·) and sin(·) arithmetic operations. More on Blum–Shub–Smale model of computation over the reals can be found in [1].

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

3

We say that a class Λ of elementary information operations is an effective information class, if there is a surjection

γ Λ : K∗ → Λ and a BSS-computable function1 evalΛ : K∗ × V → K such that evalΛ (y, v) = γΛ (y)(v ) for all y ∈ K∗ and v ∈ V . We will say that γΛ is a parametrization of the class Λ, and that evalΛ is an evaluator of the class Λ with a parametrization γΛ . We say that an information operator N: V → K∗ using elementary information operations from an effective information class Λ is an effective information operator, if N is a BSS-computable function, and there is a BSS-computable function ψN : K∗ → K∗ such that for every v ∈ V and 0 ≤ k < n(v ) we have Ly1 ,...,yk (v ) = evalΛ (ψN (y1 , . . . , yk ), v ) Moreover, we assume that at the kth step of a computation of N(v ) the decision whether to evaluate kth elementary information operation or stop the computation of N(v ) is based on ψN in the following way: the computation stops when ψN (y1 , . . . , yk−1 ) = ψN (y1 , . . . , yk−1 , yk ). We will say that ψN is an adaption mechanism of the information operator N. Note that the effectiveness of an information operator N implies that it is possible to algorithmically identify which elementary information operations have been evaluated on an unknown v ∈ V in the process of obtaining N(v ). In the sequel we will use effective information classes and effective information operators. We assume that all arithmetic operations on reals are allowed and are performed in a constant time. All described algorithms make use only of the possibility to store and access finite sequences of real numbers, perform arithmetic operations on them and do bounded iteration. Hence the cost of an algorithm is understood as the (maximal) number of real arithmetic operations performed during its execution. We define the computational cost of a solution (N , φ ) as cost(N , φ ) = sup (costi (N , v ) + costa (φ, N(v ))) v∈V

where for v ∈ V and y ∈ K∗ costi (N , v ) = n(v ) costa (φ, y) = the number of arithmetic operations performed by an algorithm φ on an input y The computational complexity of an identification problem idV : V → V is defined as comp(idV ) = inf{cost(N , φ ): (N , φ ) is a solution of idV } In the remainder of this section V will denote a finite dimensional linear space. We will assume that V is represented as a subset of R∗ in such a way, that the addition of vectors and the multiplication of vectors by scalars are BSS-computable functions. For a real or complex linear space V , let ≡ be a binary relation on V defined as x ≡ y iff x = uy for some number u with |u| = 1 Clearly, ≡ is an equivalence relation. We will write V for V /≡. For every x ∈ V its abstraction class with regard to the equivalence relation ≡ will be denoted by [x]≡ . We assume that equivalence 1 Under identification of K and V with subsets of R∗ as described in Remark 2.1.

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

4

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

classes are represented by their members. Therefore, in order to define an algorithm φ : K∗ → V it is sufficient to define an algorithm φ : K∗ → V and set φ (y) = [φ (y)]≡ for all y ∈ K∗ . Suppose that Λ is some class of elementary information operations consisting of linear functionals. We define a new class of information operations

|Λ| = {|L|: L ∈ Λ} where |L|(v ) = |L(v )| for v ∈ V , here |·| is an absolute value in case of real spaces, and modulus for complex spaces. Note that elements of |Λ| act on V in a natural way since |L(uv )| = |L(v )| for all v ∈ V and all numbers u such that |u| = 1; hence we define |L| ([v]≡ ) = |L|(v ) for every L ∈ Λ and an equivalence class [v]≡ ∈ V . If Λ is an effective information class with a parametrization γΛ and an evaluator evalΛ ; then |Λ| is also an effective information class with a parametrization γ|Λ| defined as γ|Λ| (y)(v ) = |γΛ (y)(v )|, and an evaluator eval|Λ| defined as eval|Λ| (y, v ) = |evalΛ (y, v )|. We consider two identification problems related to the classes Λ and |Λ|, correspondingly idV : V → V and idV : V → V The problem of identification of v ∈ V based on values L1 (v ), L2 (v ), . . . , Ln (v ) for some Li ∈ Λ is the exact identification with exact evaluations of Li . This is the problem idV when the class Λ is available. In the setting of phase retrieval, one instead tries to identify v ∈ V up to a unit u (i.e. a scalar u such that |u| = 1) based on absolute values |L1 (v )|, |L2 (v )|, . . . , |Ln (v )|. This is the problem idV when the class |Λ| is available. For technical reasons we will now introduce another information class, namely |Λ|+1 . The class |Λ|+1 consists of all elementary information operations from the class |Λ|, but is augmented with one additional operation: the lifting of a modulus from one chosen evaluation from |Λ|. More formally; if the information operator N is using the class |Λ|+1 then it performs, on any element v ∈ V , a sequence of evaluations |L1 (v )|, |L2 (v )|, . . . , |Ln (v )|, possibly in an adaptive way. After all evaluations have been obtained, an index k ∈ {1, 2, . . . , n} is chosen, and this choice depends only on the obtained sequence of values in an effective way. This means that there is a BSS-computable function (in the sense of Remark 2.1) indexN : K∗ → N such that the index to be chosen is k = indexN (|L1 (v )|, |L2 (v )|, . . . , |Ln (v )|) Next, the value Lk (v ) is revealed. After that the computation of N(v ) terminates with the following value N(v ) = (|L1 (v )|, . . . , |Lk−1 (v )|, Lk (v ), |Lk+1 (v )| . . . , |Ln (v )|) We assume that the lifting of a modulus from |Lk (v )| to reveal the value of Lk (v ) does not generate any additional cost, i.e., costi (N , v ) = n. Lemma 2.1 (Reductions). There exists a solution (N , φ) of the idV : V → V that uses information ( problem ) operations from the class |Λ|+1 iff there exists a solution N , φ of the problem idV : V → V that uses information operations from the ( class |)Λ|. Moreover, if any of the mentioned solutions exist we can ensure that costi (N , v) = costi N , [v]≡ for all v ∈ V . Proof. Suppose that (N , φ) is a solution of idV : V → V that uses information operations from the class |Λ|+1 . Observe that N induces an information operator N acting on V , and using information operations from the class |Λ|. Indeed, N can be defined as performing the same evaluations as N, this time acting on V instead of V , but skipping the modulus lifting phase. We define φ = φ . Observe that φ ◦ N = idV . Indeed, take any [v]≡ ∈ V . Note that

( ) φ N([v]≡ ) = φ (N(uv )) = uv for some scalar u with |u| = 1. Now, uv is a member of the equivalence class [v]≡ and thus N , φ ( ) is a solution of idV . Obviously costi (N , v) = costi N , [v]≡ for all v ∈ V .

(

)

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

5

Conversely, let N , φ be a solution of idV : V → V that uses information operations from the class |Λ|. We will define an information operator N acting on V , and using information operations from the class |Λ|+1 . Since for every v ∈ V we have v ∈ [v]≡ , and by definition |L| ([v]≡ ) = |L|(v ) for every L ∈ Λ, the information operator N can be considered as acting on V in a natural way. Note that the function

(

)

last: K∗ → N where last(y) is defined as the index of the last non-zero element of y ∈ K∗ if y contains any, and 1 otherwise; is BSS-computable. Now, the computation of N(v ) proceeds as follows: first, y = N(v ) is computed; then k = last(y) is being found; and finally the modulus from yk is being lifted. We will now specify an algorithm φ . For every y = (y1 , . . . , yn ) ∈ K∗ , if y contains only zeros, then φ (y) = 0 ∈ V ; otherwise

φ (y) = φ (|y|)

ylast(y) evalΛ ψN (y1 , . . . , ylast(y)−1 ), φ (|y|)

(

)

where ψN is the adaption mechanism of N, and |y| is defined componentwise, i.e. |(y1 , y2 , . . . , yn ) | = (|y1 |, |y2 |, . . . , |yn |). Note that for every v ∈ V we have φ (N(v )) = uv for some scalar u with |u| = 1. If for any L ∈ Λ such that L(v ) ̸= 0 we know the values L(v ) and L(uv ), then we may φ (N(v )) L(uv ) compute u = L(v ) , and trivially: v = . This shows that (N , φ ) is a solution of idV . Again, u ( ) obviously costi (N , v) = costi N , [v]≡ for all v ∈ V . □ For any field K, let Kn [x] = {a0 + a1 x + · · · + an xn : a0 , a1 , . . . , an ∈ K} be the linear space of all polynomials of degree at most n (n ∈ N) and coefficients from K, and let Kn [x] = Kn [x]/≡. Note that for every f ∈ Kn [x] and s ∈ K the evaluation of f at s is well defined as Ls (f ) = a0 + a1 s + · · · + an sn We consider K being the field of real numbers, some subfield of it, or the field of complex numbers. We assume that the domain of any polynomial from Kn [x] is: R, if K is a subfield of R C, if K = C We will consider two basic classes of elementary information operations acting on spaces of polynomials:

Λstd = {f ↦→ f (x): for some x in the domain of f }

|Λstd | = {f ↦→ |f (x)|: for some x in the domain of f } In the sequel we will use algorithms that may rely on information operations from the class Λstd ∪ |Λstd |, we will always make it explicit how many operations from each of the classes have been used by an algorithm. We will call the usage of an operation from Λstd an exact evaluation, while the usage of an operation from |Λstd | will be called a phaseless (or signless in the real case) evaluation. Our aim is to investigate the computational complexity of the problem idKn [x] : Kn [x] → Kn [x] when |Λstd | is available, or equivalently of the problem idKn [x] : Kn [x] → Kn [x] if we can use any number of information operations from |Λstd | and exactly one operation from Λstd . Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

6

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

3. Real polynomials This section deals with phaseless identification of real polynomials. We shall start with the following observation. Theorem 3.1. The computational complexity of the problem idRn [x] : Rn [x] → Rn [x] for the class |Λstd | is polynomial in n. Furthermore, it has a solution (N , φ ) such that N is non-adaptive and costi (N , v ) = 2n + 1 for all v ∈ Rn [x], and sup{costa (φ, y): y ∈ N(Rn [x])} is O(n2 ). Proof. Let us consider the general form of a real polynomial p(x) = a0 + a1 x + a2 x2 + · · · + an xn where a0 , a1 , . . . , an ∈ R. Suppose that 0 ≤ j ≤ n is the smallest index such that aj is non-zero. The square of the absolute value of p is

|p(x)|2 = (xj (aj + aj+1 x + aj+2 x2 + · · · + an xn−j ))2 = x2j (A0 + A1 x + A2 x2 + · · · + A2(n−j) x2(n−j) ) for some A0 , A1 , . . . , A2(n−j) ∈ R. Because |p(x)|2 is a real polynomial of degree at most 2n we may determine its coefficients Ak by using its evaluations at (any) 2n + 1 distinct real points and performing polynomial interpolation. Furthermore, every evaluation of |p(x)|2 is of the form (L(p))2 , where L ∈ |Λstd |. Now, observe that

{ Ak =

a2j

if k = 0

2aj aj+k +

∑k−1 i=1

aj+i aj+k−i

if 1 ≤ k ≤ 2(n − j)

Therefore, assuming (Ak )0≤k≤2(n−j) are given, coefficients am may be computed inductively

⎧ ⎪ ⎨ 0√ ± A0 am = ∑m−j−1 ⎪ ⎩ Am−j − i=1 aj+i am−i 2aj

if m < j if m = j if j + 1 ≤ m ≤ n

Thus, we may recover the polynomial p(x) up to a sign (i.e., the sign of aj ), which completes the proof. The computational cost of the algorithm is the cost of polynomial interpolation plus some linear work. Therefore, it can be bounded by O(n2 ). □ One may wonder if 2n + 1 information operations are needed. The answer is no, provided we are satisfied with algorithms with exponential cost that use an adaptive information. Theorem 3.2. The problem idRn [x] : Rn [x] → Rn [x] has a solution (N , φ ) such that N is an adaptive information operator using information operations from the class |Λstd | and such that costi (N , v ) = n + 2 for all v ∈ Rn [x], and sup{costa (φ, y): y ∈ N(Rn [x])} is exponential in n. Proof. By Lemma 2.1 we will equivalently demonstrate that it is possible to identify a real polynomial by n + 1 signless evaluations and one exact evaluation. Let us consider p ∈ Rn [x] and a sequence x0 , x1 , . . . , xn ∈ R of distinct real numbers. Denote by (2) Bk = {−1, 1}k the kth power of the one-dimensional unit circle and by Bk = {(x, y) ∈ Bk × Bk : x ̸ = y}. For every tuple b ∈ Bn+1 there is exactly one polynomial wb ∈ Rn [x] such that wb (xi ) = bi |p(xi )| Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

7

for all 0 ≤ i ≤ n. Therefore, there are exactly 2n+1 polynomials with values, up to a sign, matching those of p in each of the points x0 , x1 , . . . , xn . Moreover, if b ̸ = b′ then the equation wb (x) = wb′ (x) has at most n solutions. It follows that the set

S = {x ∈ R: ∃(b,b′ )∈B(2) wb (x) = wb′ (x)} n+1

is finite, thus does not exhaust the whole R. Therefore, there exists x ∈ R that differentiates any pair of polynomials wb and wb′ , and we may perform one exact evaluation at that x, which will uniquely identify b such that wb = p, and hence also p. Moreover, x can be effectively found in exponential time by a naive algorithm that separates an interval from the set of all zeros of polynomials wb −wb′ computed to any finite precision. □ One may also observe that since S is of Lebesgue measure zero, any point randomly chosen from any non-degenerated interval will almost surely not belong to S. Example 3.1. Consider p ∈ Rn [x] for n = 3. Let us choose {1, 2, 3, 4} for the set of four evaluation points and assume that |p(x)| = 1 for every x ∈ {1, 2, 3, 4}. Fig. 1 shows all possible polynomials of degree at most 3 that satisfy the signless evaluations. The polynomials whose value at 1 is positive (i.e. equal to 1) are p1 (x) = 1 p2 (x) = −x3 + 8x2 − 19x + 13 p3 (x) = x3 − 7x2 + 14x − 7 1 11 p4 (x) = − x3 + 2x2 − x+3 3 3 2 p5 (x) = x − 5x + 5 68 4 x + 15 p6 (x) = − x3 + 10x2 − 3 3 2 31 p7 (x) = x3 − 5x2 + x−5 3 3 26 1 x+7 p8 (x) = − x3 + 3x2 − 3 3 The remaining polynomials are their opposites. Observe that at x = π no two non-opposite polynomials have the same absolute value, because π is a transcendental number (i.e., it is not algebraic) and the coefficients of the above polynomials are rational. Therefore, we can choose x = π for our final evaluation. The above example suggests that some information about coefficients of the polynomials can be used to find evaluation points in a non-adaptive way. Theorem 3.3. Suppose that A is a subfield of the field R of real numbers, such that the field extension A ⊆ R is transcendental. The problem idAn [x] : An [x] → An [x] has a solution (N , φ ) such that N is a non-adaptive information operator using information operations from the class |Λstd | and such that costi (N , v ) = n + 2 for all v ∈ An [x], and sup{costa (φ, y): y ∈ N(Rn [x])} is exponential in n. Proof. We proceed similarly as in the proof of Theorem 3.1. By Lemma 2.1 we will equivalently demonstrate that it is possible to identify a polynomial from An [x] by n + 1 signless evaluations and one exact evaluation. However, this time all evaluation points (including the one to perform an exact evaluation) can be fixed beforehand and used without any need for adaption.

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

8

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

Fig. 1. All possible 3-polynomials p with |p(x)| = 1 for x ∈ {1, 2, 3, 4}.

Let us consider p ∈ An [x] and a sequence x0 , x1 , . . . , xn ∈ A of distinct elements of A. For every tuple b ∈ {−1, 1}n+1 there is exactly one polynomial wb ∈ An [x] such that wb (xi ) = bi |p(xi )| for all 0 ≤ i ≤ n. This follows from two observations:

• −1A = A, since A is a field • a matrix with entries from A has an inverse over A iff it has an inverse over R Therefore, there are exactly 2n+1 polynomials from An [x] with values, up to a sign, matching those of p in each of the points x0 , x1 , . . . , xn . From the assumption, the set

D = R \ {x ∈ R: ∃w∈An [x] w (x) = 0} is non-empty. By definition any x ∈ D differentiates every pair of distinct polynomials w, v ∈ An [x]. Thus we may perform one exact evaluation at any fixed x ∈ D, which will uniquely identify b such that wb = p, and hence also p. □ Corollary 3.1. The problem idQn [x] : Qn [x] → Qn [x] can be solved using n + 1 signless evaluations at any distinct points x0 , x1 , . . . , xn ∈ Q, and one evaluation at any point x ∈ R such that x is a transcendental number. Remark 3.1. Under assumption of Theorem 3.3 one may solve idAn [x] : An [x] → An [x] with a single exact evaluation at any transcendental (wrt. A) number x. This follows from the observation that the evaluation Lx : An [x] → R at transcendental x is an injection. On the other hand, without further assumptions about A there is no upper bound on the algorithmic cost of the problem (in fact, there may be no algorithm realizable on a Blum–Shub–Smale machine). 4. Complex polynomials Phaseless identification of complex polynomials is much more complicated than in the case of real polynomials. One reason for this increased difficulty is that the cardinality of the unit sphere for complex numbers, i.e. {x ∈ C: |x| = 1}, is continuum; whereas the unit sphere for real numbers, i.e. {x ∈ R: |x| = 1}, is just the 2-element set {−1, 1}. The next theorem exploits this difference.

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

9

Theorem 4.1. There is no solution of the problem idCn [x] : Cn [x] → Cn [x] that relies on the information operator N which uses at most n information operations from the class Λstd and at most n + 1 information operations from the class |Λstd |. (2)

Proof. Let us denote Bk = {x ∈ Ck : ∀0≤i
U = {b − b′ ∈ Cn+1 : b, b′ ∈ Bn+1 }

(

)

is equal to the set

{c ∈ Cn+1 : c ̸= 0 ∧ |c i | ≤ 2 for all 0 ≤ i ≤ n} Therefore, if we define

S = {x′ ∈ Cn : ∃c ∈U wc (x′ i ) = 0 for all 0 ≤ i ≤ n − 1} we see that for x′ ∈ Cn we have x′ ∈ S ⇔ ∃b,b′ ∈B(2) ∀0≤i
We argue that S = Cn . Consider any tuple x′ ∈ Cn and any non-zero polynomial p ∈ Cn [x] for |p(x )| which x′ is a tuple of all its roots. If M = 21 max0≤i≤n |f (xi )| , then the polynomial p/M is non-zero i and satisfies

|(p/M)(xi )| ≤ 2 for all 0 ≤ i ≤ n. |f (xi )| Therefore, there is a non-zero tuple c ∈ Cn+1 such that (p/M)(xi ) = c i |f (xi )| and |c i | ≤ 2 for all 0 ≤ i ≤ n. Since p/M ∈ Cn [x], it follows that p/M = wc , and hence wc (x′ i ) = 0 for all 0 ≤ i ≤ n − 1. It means that x′ ∈ S, so indeed S = Cn . Note that f ∈ {wb ∈ Cn [x]: b ∈ Bn+1 } and for every wb , it holds that

|wb (xi )| = |f (xi )| for all 0 ≤ i ≤ n Now, if x′0 , x′1 , . . . , x′n−1 ∈ C is any sequence of distinct complex numbers, there are some b, b′ ∈ Bn+1 such that b ̸ = b′ and wb (x′i ) = wb′ (x′i ) = f (x′i ) for all 0 ≤ i ≤ n − 1. Since f is equal to at most one of wb or wb′ , we cannot identify f . □ Corollary 4.1. There is no solution of the problem idCn [x] : Cn [x] → Cn [x] that relies on the information operator N using information operations from the class |Λstd | and such that costi (N , v ) ≤ 2n + 1 for all v ∈ Cn [x]. Proof. Since the information |Λstd | is weaker than the information Λstd , it follows from Theorem 4.1 that idCn [x] : Cn [x] → Cn [x] cannot have a solution based on an information operator using exactly one information operation from the class Λstd and 2n information operations from the class |Λstd |. Consequently, by Lemma 2.1, idCn [x] : Cn [x] → Cn [x] cannot have a solution based on an information operator using 2n + 1 information operations from the class |Λstd |. □ Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

10

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

Fig. 2. Plot of wb and wb′ for x ∈ [−0.1, 6.1].

Let us see how Theorem 4.1 works on a simple example. Example 4.1. Let f ∈ Cn [x] and assume that the evaluation points are x = (1, 2, 3, 4) with their corresponding values all equal to 1. Consider (any) triple of points, for example {0, 5, 6} and take any non-zero polynomial whose roots are at these points, for instance p(x) =

1 24

x3 −

11 24

x2 +

5 4

x

The polynomial p maps tuple x = (1, 2, 3, 4) to tuple c =

) , 1, 34 , 13 . Therefore, p(xi ) = ci f (xi ) for ′ all 0 ≤ i < 4 and |ci | ≤ 2 for 0 ≤ i < 4. Let us decompose c as c = b − b , where |bi | = |b′i | = 1 for 0 ≤ i ≤ 3: ( √ √ √ √ ) 5 + 119i 1 + 3i 3 + 55i 1 + 35i b= , , , 12

( ′

b =

2

8

√ −5 + 12

,

2

√ 3i −3 +

,

6

6

√ 119i −1 +

(5

8

√ 55i −1 +

,

35i

)

6

Fig. 2 shows interpolating polynomials wb and wb′ , whereas Fig. 3 shows the plot of |wb |, which is equal to |wb′ | on R. Note that the absolute values of both wb and wb′ are all ones at (1, 2, 3, 4), and the values are equal on (0, 5, 6) (in fact, in this example, the absolute values are equal on the whole real line). The above example shows that if we are not careful enough in choosing the evaluation points, we may not be able to distinguish between the polynomials in any number of evaluations (recall Fig. 3). The next example shows that it is not sufficient to evaluate polynomials at points with different imaginary values. Example 4.2. Consider the following polynomials: p(x) = 2x + 1 q(x) = x + 2

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

11

Fig. 3. Plot of |wb | on R (which is equal to |wb′ | on R).

We have: 2

|p(eix )| = 2eix + 1 = (2 cos(x) + 1)2 + 4 sin2 (x) = 4 cos(x) + 5 2

|q(eix )| = eix + 2 = (cos(x) + 2)2 + sin2 (x) = 4 cos(x) + 5 Therefore, |p(eix )| = |q(eix )|, i.e. the absolute values of p and q on the unit circle are equal. On the other hand, if we know the polynomial in advance, we may always choose n + 2 points such that the absolute values at the chosen points identify the polynomial up to a phase. Example 4.3. Let ω0 , ω1 , . . . , ωn be the sequence of (n + 1)th primitive roots of 1. There is exactly one polynomial w of degree at most n such that: w (0) = 1 and |w (ωi )| = 1 for all 0 ≤ i ≤ n. Consider a polynomial w (x) = a0 + a1 x + a2 x2 + · · · + an xn . The constraint w (0) = 1 says that a0 = 1. Now, let us take the Vandermonde matrix T at ωi . We have to solve the equation: T a = u, where ui = 1 and a0 = 1. Because T is non-singular, this is equivalent to the equation a = T −1 u. 1 The first row of the inverse of the Vandermonde matrix is of the form [ n+ , 1 , . . . , n+1 1 ]. Thus 1 n+1 ∑n 1 we obtain equation: 1 = n+1 i=0 ui , which is satisfied only if ui = 1 for all 0 ≤ i ≤ n. Therefore, a = T −1 1 is the unique solution. Example 4.1 shows that it is not sufficient to evaluate polynomials at any fixed points with the same imaginary value, whereas Example 4.2 shows that it is not sufficient to evaluate polynomials at any fixed points with the same real value. One may wonder if it is sufficient to evaluate polynomials at some fixed set of points whose both real and imaginary values vary. The next theorem gives an affirmative answer to this question. Theorem 4.2. The computational complexity of the problem idCn [x] : Cn [x] → Cn [x] for the class |Λstd | is polynomial in n. Furthermore, it has a solution (N , φ ) such that N is non-adaptive and costi (N , v ) = (2n + 1)2 for all v ∈ C[x], and sup{costa (φ, y): y ∈ N(Cn [x])} is O(n2 log(n)). Proof. Let us consider the general form of a complex polynomial p ∈ Cn [x] p(x) = a0 eα0 i + a1 eα1 i x + a2 eα2 i x2 + · · · + an eαn i xn Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

12

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

with kth coefficient presented in polar coordinates ak eαk i , where ak ≥ 0 and 0 ≤ αk < 2π . Similarly, by using polar coordinates for x ∈ C, we have x = reyi and then p may be presented as p(reyi ) = a0 eα0 i + a1 re(α1 +y)i + a2 r 2 e(α2 +2y)i + · · · + an r n e(αn +ny)i The formula for the (square of) absolute value of p(reyi ) takes the following form: 2

|p(reyi )| = (a0 cos(α0 ) + a1 r cos(α1 + y) + · · · + an r n cos(αn + ny))2 + (a0 sin(α0 ) + a1 r sin(α1 + y) + · · · + an r n sin(αn + ny))2 n n ∑ ∑ bk,m r 2k−m (cos(βk,m ) cos(my) + sin(βk,m ) sin(my)) = m=0 k=m

where:

{ bk,m =

a2k 2ak ak−m

if m = 0 otherwise

βk,m = αk − αk−m

(1) (2)

Let r0 , r1 , r2 , . . . , r2n be a sequence of distinct non-negative real numbers. For every ri let us define function ˆ ri : [−π , π] → R as follows 2

ˆ ri (α ) = |p(ri eα i )|

By the above observation, ˆ ri is a trigonometric polynomial

ˆ r i (α ) =

n ∑

Ari ,m cos(mα ) + Bri ,m sin(mα )

m=0

with: Ari ,0 =

n ∑

bk,0 ri2k

k=0

Ari ,m =

n ∑

bk,m ri2k−m cos(βk,m )

k=m

Bri ,0 = 0 Bri ,m =

n ∑

bk,m ri2k−m sin(βk,m )

k=m

To recover coefficients Ari ,m and Bri ,m we use a standard trick. If we set cri (α ) =

ˆ r i (α ) + ˆ ri (−α ) 2

=

n ∑

Ari ,m cos(mα )

m=0

the coefficients Ari ,m may be recovered by the inverse cosine transform from values cri (ω0 ), cri (ω1 ), cri (ω2 ), . . . , cri (ω2n ) where ωk = knπ . Similarly, setting sri (α ) =

ˆ r i (α ) − ˆ ri (−α ) 2

=

n ∑

Bri ,m sin(mα )

m=1

gives the coefficients Bri ,m by the inverse sine transform from values sri (ω1 ), sri (ω2 ), . . . , sri (ω2n ). Observe that to recover both Ari ,m and Bri ,m we need 2n + 1 evaluations of ˆ ri . Therefore, to recover Ari ,m and Bri ,m at every ri for 0 ≤ i ≤ 2n, we need (2n + 1)2 evaluations of |p|. Now, let us consider the following polynomials of order at most 2n: A0 (r) =

n ∑

bk,0 r 2k

k=0

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.

M.R. Przybyłek and P. Siedlecki / Journal of Complexity xxx (xxxx) xxx

Am (r) =

n ∑

13

bk,m r 2k−m cos(βk,m )

k=m

Bm (r) = 0 Bm (r) =

n ∑

bk,m r 2k−m sin(βk,m )

k=m

By definition Am (ri ) = Ari ,m and Bm (ri ) = Bri ,m , so we know the values of each Am , Bm at 2n+1 distinct points. Therefore, we may recover coefficients bk,0 , bk,m cos(βk,m ) and bk,m sin(βk,m ) by polynomial interpolation. Because ak are non-negative, they are uniquely determined by a2k = bk,0 . From (Eq. (1) we can compute remaining bk,m . Note that now we may compute the values of each pair cos(βk,m ), ) sin(βk,m ) and then: βk,m = αk − αk−m . Fixing α0 fixes the rest of the angles, therefore up to α0 all coefficients of polynomial p are uniquely determined. The computational cost of the naive implementation of the above algorithm is O(n3 ), but we can improve it to O(n2 log(n)) by treating ˆ ri (α ) as a Fourier series and applying to it the Inverse Fast Fourier Transform. □ 5. Summary and further work In the present paper we have investigated the relation between the number of (exact and phaseless) polynomial evaluations and combinatorial cost needed to process it in order to recover an unknown polynomial. However, some questions still remain open. Note that in Theorem 3.2 we have shown that n + 2 (phaseless) polynomial evaluations are sufficient for the (phaseless) recovery of a polynomial, however the combinatorial cost of processing of that information, being in fact the cost of a full search among all possible candidates for a solution, is exponential in n. Hence the natural question arises: can we do it faster? More generally, what is the minimal number of (phaseless) evaluations of an unknown polynomial p such that it is possible to perform (phaseless) identification of p in a polynomial time? It is of interest to investigate this both in the real and the complex case. Acknowledgments This research was supported by the National Science Centre, Poland, under projects 2018/28/C/ST6/00417 (M.R. Przybyłek) and 2017/25/B/ST1/00945 (P. Siedlecki). References [1] L. Blum, F. Cucker, M. Shub, S. Smale, Complexity and Real Computation, in: SpringerLink: Bücher, Springer New York, 2012. [2] K. Jaganathan, Y.C. Eldar, B. Hassibi, Phase retrieval: An overview of recent developments, 2015, CoRR, abs/1510.07713. [3] L. Plaskota, Noisy Information and Computational Complexity, Cambridge University Press, 1996. [4] L. Plaskota, P. Siedlecki, H. Woźniakowski, Absolute value information for IBC problems, J. Complexity 56 (2020) 101427, http://dx.doi.org/https://doi.org/10.1016/j.jco.2019.101427. [5] J.F. Traub, G.W. Wasilkowski, H. Woźniakowski, Information-based Complexity, Academic Press Professional, Inc., San Diego, CA, USA, 1988.

Please cite this article as: M.R. Przybyłek and P. Siedlecki, A note on the complexity of a phaseless polynomial interpolation, Journal of Complexity (2019) 101456, https://doi.org/10.1016/j.jco.2019.101456.